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COMPLEMENTED MODULAR LATTICES 
ICHIRO AMEMIYA! anv ISRAEL HALPERIN 


1. Introduction. 


1.1. This paper gives a lattice theoretic investigation of “‘finiteness’’* and 
“continuity of the lattice operations’ in a complemented modular lattice. 
Although we usually assume that the lattice is X-complete for some infinite X,* 
we do not require completeness and continuity, as von Neumann does in his 
classical memoir on continuous geometry (3); nor do we assume orthocom- 
plementation as Kaplansky does in his remarkable paper (1). 


1.2. Our exposition is elementary in the sense that it can be read without 
reference to the literature. Our brief preliminary § 2 should enable the reader 
to read this paper independently. 


1.3. Von Neumann’s theory of independence (3, Part I, Chapter II) leans 
heavily on the assumption that the lattice is continuous, or at least upper 
continuous. We do not assume such continuity and we find it necessary 
therefore to distinguish several concepts of independence for a family of 
elements {a,;’ € A}: independence shall mean that a> -(a,;4 € F) =0 
whenever F is a finite subset of A and \ ¢ F; residual independence shall mean 
that a, >> (a,; « # A) = O for every A; and strong independence shall mean that 
IL, D(a; ¥ d) = 0. 

Strong independence is sufficiently restrictive that, even without assuming 
continuity of the lattice operations, many of the continuous geometry argu- 
ments remain vaild. For example, if {a,, b,;’ € A} is strongly independent 
and for each \ there is given a perspective mapping of [0, a] onto [0, |], 
then these mappings can be. imbedded in a single perspective mapping of 
[0, >-a,] onto [0, >>]. ws 


§ 3 is devoted to a discussion of independence. 


1.4. Suppose L is complemented, modular, and countably complete. Von 
Neumann's arguments (3, Part I, Theorem 4.3) show that L is finite, that is, 


Received April 29, 1958; presented to the American Mathematical Society, January 29, 1958. 

1Post-doctoral Fellow (of the National Research Council of Canada) at Queen's University, 
on leave of absence from Tokyo College of Science. 

*In this paper, a lattice is called finite if every independent sequence of pairwise perspective 
elements has all its elements coinciding with zero. Sequence shall mean infinite sequence through- 
out this paper. We note however that finite families {a;,...,a@n,} of pairwise perspective 
elements were used by von Neumann in his coordinatization theory (3, Part II, chapter III) 
and play a key role there. 

*Throughout this paper N denotes an arbitrary (but fixed) infinite cardinal (that is X > No); 
Q denotes the least ordinal number whose corresponding cardinal power is X 
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an independent sequence {a,} of pairwise perspective elements with a, ~ 0 
cannot exist, if the lattice is No-continuous (this means: the lattice is both 
upper No-continuous and lower No-continuous). 

If No-continuity does not hold, then such sequences {a,} can occur. But 
we find the paradoxical result: the existence of such sequences actually forces a 
certain type of continuity to hold. This situation is described more precisely 
in the following paragraph. 

A homogeneous sequence is defined to be a strongly independent sequence 
{a,} of pairwise perspective elements. We draw attention to two important 
special cases: 

(i) Type (A): all a, have a common complement, that is, for some element 
A,a,@A =1. 

(ii) Type (A*): all a*, = >> (a,; m # n) have a common complement.‘ 

In § 5 we show: Suppose {a,} is a homogeneous sequence; then the lattice 
[0, >-a,] is upper No-continuous if and only if {a,} is of type (A), lower No- 
continuous if and only if {a,} is of type (A*). Thus, if {a,} is of both types (A) 
and (A*), the above-mentioned result of von Neumann shows that all a, 
must be 0. 

In § 8 we show that the types (A) and (A*) are mutually exclusive in a 
stronger sense, namely: If {a,} and {d,} are homogeneous sequences of types 
(A) and (A*) respectively, then }>a, and }-d, are completely disjoint (this 
means: @ perspective to 6 with a < }-a, and 6 < }d, can occur only when 
a =6=0). On the other hand, these two types are exhaustive in the 
following sense: every homogeneous sequence {a,} has a unique decomposition 
a, = b, +c, with {b,} a homogeneous sequence of type (A) and (c,) a 
homogeneous sequence of type (A*). 

From these facts about homogeneous sequences we can deduce (see § 8): 
If L is complete then L has a direct sum decomposition L = L; + L2 + Ls 
where L, = (0, a,) with each a, in the centre of L, and with L; upper No- 
continuous, L, lower No-continuous and L; finite. 


1.5. Now suppose L is even X-complete for a given infinite X. We call L 
locally X%-continuous if for every a ~ 0 there exists some 0 ¥ a; < a with 
[0, a,] N-continuous. We show (see Corollary 1 to Theorem 7.1): If L is 
locally N-continuous and finite then L must be XN-continuous. 


1.6. In §§ 4, 6 we establish, among other properties of finiteness and con- 
tinuity, that they are additive, that is, if [0, a] and [0, 6] enjoys one of these 
properties then so does [0, a + 5]. 


1.7. Finally, in § 9 we prove theorems somewhat more general than that 
of Kaplansky (1). Kaplansky proved: (i) every countably complete ortho- 


‘If (a, . . . , @m) is a finite independent family of pairwise perspective elements, then (i) and 
(ii) both hold, that is, the a, have a common complement and the a*, have a common 
complement. 
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complemented modular lattice is finite and (ii) every complete orthocom- 
plemented modular lattice is necessarily continuous. 

Our work gives lattice theoretic proofs for generalizations of both of these 
results. In particular, (ii) is strengthened to (ii)’ every N-complete ortho- 
complemented modular lattice is N-continuous. 

More generally, we prove, generalizing (i): 


THEOREM 9.1. A countably complete, complemented modular lattice is finite, 
if (*): for every a # O there exists an anti-automorphism @ of L such that b 
perspective to a subelement of a occurs for some b # 0 with b ¢(a) = 0. 


(*) holds, for example, if L possesses an orthocomplementation, or even, in 
the case that L is complete, if L possesses an anti-automorphism which is an 
orthocomplementation on the centre of L (see Corollary 1 to Theorem 9.1). 

We prove, generalizing (ii)’: 


THEOREM 9.5. An X-complete complemented modular lattice is X-continuous 
if it is finite and possesses an anti-automorphism ¢ of period two with the follow- 
ing continuity property: (**) for every limit ordinal number Q; < Q, x3 + (xs) =1, 
xpo(xs) = 0 for all B <Q, and xg < x, for all B < y <Q; together imply 
(Six) + (Dox) = 1, (Oxs)o( x5) = 0. 


Clearly every orthocomplementation ¢ has the property (**). 


1.8. An alternative (but still lattice theoretic) proof of the Kaplansky's 
finiteness theorem for the orthocomplemented case (see (i) in § 1.7 above) is 
given in an Appendix. This Appendix can be read independently of the rest 
of this paper and it is somewhat related to Kaplansky’s original method. 


2. Preliminaries. 


2.1. Let L be a set of elements partially ordered by a relation a < 6 (written 
also b > a). By definition, partial ordering means: a < b, b < c imply a < ¢, 
and a < 6, 6 <a hold if and only if-a = 5 (that is, a and 6 are the same 
element). 

When a, is in L for each \'€ A we call a the union of the a, and write 
@ = > yad (or Sa,) if a is an element such that: x > a, for every \ is equiva- 
lent to x > a. We call a the meet of the a, and write a = I],,,a, (or Ila) if 
a is an element such that: x < a, for every \ is equivalent to x < a (each of 
union and meet is clearly unique if it exists at all). 

The zero (unit) in Z written as 0(1), is defined to be the element (if it exists) 
such that 0 < x(x < 1) holds for all x in L. 

The dual to any statement or construction concerning elements of L is 
obtained by replacing < by >; E, II byII,¥ respectively and 0, 1 by 1, 0, 
respectively. L’ denotes the partially ordered set dual to L. Any theorem 


implies its dual. 


5This section is mostly based on the original material of J. von Neumann (see (3) or (2) ). 
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L is called complete if a,, [a exist for all families {a,;\ € A}; N-complete 
if these elements exist whenever A < &;;° a lattice if it is 2-complete (hence 
n-complete for every nm = 2,3,...). 

A lattice is called modular if a(b + c) = 6 + ac whenever a > b (equiva- 
lently, if: a(6 + c) = a(b(a + c) +c) for all a, b,c). 

When a < 5b we write L(a, 6) or [a, 6] to denote the sub-partially-ordered 
set of all x with a < x < 5; clearly, it has a, b as zero and unit respectively. 


2.2. Let L be a lattice with zero element. Elements a, 6 are called disjoint if 
ab = 0 (@ shall mean + but shall imply that the summands are disjoint). 

If a < c, [ce — a] will denote any element A, to be called a complement of 
a in c (sometimes called a relative complement of a in c), for which a@ A = c. 
If ZL has a unit, [1 — a] (if it exists) is called a complement of a. 

L is called complemented if 0, 1 exist in L and every a has at least one com- 
plement. L is called orthocomplemented if 0,1 exist in L and L possesses an 
anti-automorphism @ of period 2 with ¢(a) @ a = 1 for all a. 

If L is complemented and modular, a relative complement [c — a] exists 
always (c{l — a] will do); then, whenever ab = 0 there exists a complement 
A of a with A > b (indeed, 6 + [1 — (a + d)] will do for A). 

If L is modular and A is a complement of a then [0, a] and [A, 1] are lattice 
isomorphic under the mutually inverse mappings: 


(2.1) a4,74,+A ifa,;<a; A, —-aA,if A, >A. 

2.3. Let L be a modular lattice with zero element. The elements a and b are 
called perspective with axis x (we write a~ db), if a@ x = b6@ x; we may 
replace x by x(a + b) to obtaina@ x = b@x=a+b. 


If a, b are perspective with axis x then [0, a] and [0, 5] are lattice isomorphic 
under the mutually inverse perspective mappings: 


a4, (a,+ x)bifa, ca; b7(b4+ xaif bi < b 
(clearly, a; ~ 5; with the same axis x). We note: 
(2.2) a~c,c~b, (a+c)b =0 imply a~ 5; 
for ifa@x=x@c=a+candc@®y=yOb=c+5 then 
a@® (x + y)(a + b) = b@ (x + y)(a + BD). 


Elements a, 6 are called projective (we write a = b) if a = a, and b = a, 
for some finite family a;,...,@, with a, ~ a4; for i < m. 
We shall say that an element a can be doubled in L if 


(2.3) a ~ u holds for some u in L with ua = 0. 
If a modular lattice L with zero has a unit, we shall say the lattice L can 


®A denotes the cardinal power of A. For every family of lattice elements {a,; € A} which 
we consider, we shall suppose that A < %. Von Neumann defines L to be an XX -lattice if it is 
Ni-complete for every NX, < N (see (3, Part Ill, Definition A. 1) ). 
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be doubled if there exists a modular lattice L; with zero such that for some u 
in Ly, L is lattice isomorphic to (0, u] and u can be doubled in L;. Clearly if 
a can be doubled in L, the lattice [0, a] can be doubled. 

We write (as in von Neumann (3, Part II, Definition 3.4)), (a, 6, x)C to 
mean: a@@x=b@x=a@®@ b. 

We call a and b completely disjoint and write (a, b)P to mean: 


(2.4) a, ~ b;, a; < a, 6; < b together imply a; = 0. 
Clearly (a, 6)P implies ab = 0. 

We say: 
(2.5) a is in the centre of L if (a, b)P holds whenever ab = 0. 


2.4. Let L be a complemented modular lattice. Then (a, 6)P holds if and only if: 
(2.6) every complement of 6 contains a. 


(Suppose (2.6) fails: if B is a complement of 6 with B >a false, then 
a, = [a — aB) ~ O and a, ~ (B + a,)b (with axis B) so (a, b)P does not 
hold. Suppose, on the other hand, (2.6) does hold: then if a; < a, b; < 8, 
and a; ~ 6; with axis x, we have in succession: a,b) < ab = 0; b(a; + b;)x = 0; 
there exists a complement B of 6 with B > (a; + 6;)x; B > (a; + b:)x + ay; 
B > bi; b; = 0; a; = 0; hence (a, 6)P holds.) 

(2.6) is also equivalent to: every complement of a contains b (consequently, 
a is in the centre of L if and only if it has a unique complement, necessarily 
also in the centre of L, and a is in the centre of L if and only if it is in the 
centre of L’). 

Hence in a complemented modular lattice, if (a, ))P holds for every \, 
and >>) exists, then every complement of a contains }>) along with all 
b, so (a,>-b,)P holds; therefore, if }-a, and >>b, both exist and (}°a,,}0b,)P 
is false, we must have (qa, 5,)P false for some particular A, yu. 

Consequently, although this fact is not needed in the present paper, if d) 
are all in the centre of L then 30d, if-it exists, is also in the centre of L and, 
by duality, Ild,, if it exists, is also in the centre of L. 

If, in a complemented modular lattice, (a, b)P is false and ’~c then 
(a, c)P is also false; this follows from: 


(2.7) a~b,b~c, a #0 together imply a; ~ c, for some a; < a, 1 < € 
with a, ¥ 0. 


Clearly, we need prove (2.7) only for the case ac = ba = bc = 0. Because 
of (2.2), we may also suppose a < b+ c¢, c< a+b.’ Now it follows that 
a@®b=c@bsoa~mc (axis D). 

Hence, in a complemented modular lattice, (a, b)P holds if and only if 


"For example, if a < b + ¢ is false then a’ = |a — a(b + c)] #0, and a’ ~ b’, b' ~c’ for 
some b’ < band c’ < c. Since a’ (b’ + c’) < a’ (b +c) = 0, it follows from (2.2) that a’ ~ c’. 
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ac = 0 whenever c ~ b. Indeed, (a, })P and b~c imply ac = 0 by (2.7). 
On the other hand, if ac = 0 for all c ~ 5 then (a, 6)P does hold; for then 
ab = 0, and if a; < a, b; < b with a; ~ 5; we have b ~ (a; + [b — d;]) (use 
(2.2)); since a(a, + [6 — b,]) = a;, then we must have a; = 0, proving 
(a, 6)P does hold. 

If L is complete then for each element a@ there exists a central element 
z > a (namely, z = > a’ for all a’ perspective to subelements of a) such that: 


(2.8) (a, b)P holds if and only if 2b = 0. This z is the least element in the 
centre with property z > a. 


2.5. Let L be an X-complete lattice. A family {a,;\ < Q,} with Q; < Q, either 
increasing, that is, A < w implies a, < a, (written a, Ta to denote also 
a = >a) or decreasing, that is, \ < u implies a, > a, (written a, | a to denote 
also a = I]a,) is said to converge continuously if for every x, 5 (xa,) = xa or 
II (x + a,) = x +a, respectively; L is said to be upper N-continuous or lower 
X-continuous if every such increasing or decreasing family respectively, con- 
verges continuously (upper N-continuity of L is clearly equivalent to lower 
X-continuity of L’.* 

If a, T continuously, then for every c, ca, T continuously; indeed, for every 
x, x(d-ca,) = cx(Diam) = Do (ex)a, = Dox(ca). 


L is called X-continuous if it is both upper and lower N-continuous. 


2.6. Let L be a complemented, modular, and X-complete lattice. If {a,} is 
increasing or decreasing, then {a,} does converge continuously if: xa, = 0 
for every \ implies x}>a, = 0 or if x + a, = 1 for every \ impiles x + Ila, = 1, 
respectively. Also, L is upper XN-continuous if a, 71 implies a, converges 
continuously, lower X-continuous if a, | 0 implies a, converges continuously. 


2.7. Let L be an X-complete lattice with zero. L is called locally X-continuous 
(upper X-continuous, lower N-continuous) if a # 0 implies [0, a,:] is N-con- 
tinuous (upper N-continuous, lower N-continuous) for some 0 # a; < a. 

If L is also complemented and modular, then L is locally XN-continuous 
(upper X-continuous, lower N-continuous) if and only if the dual L’ is locally 
X-continuous (lower X-continuous, upper N-continuous); for if A # 1, let 
a be a complement of A. Then a # 0, and [0,a,] is N-continuous (upper 
N-continuous, lower X-continuous) for some 0#a,;<a. Let A;=A+[a—a)]. 
Then A < A; # 1 and [A;, 1] is N-continuous (upper N-continuous, lower 
N-continuous) by (2.1). This shows that L’ is locally X-continuous (lower 
N-continuous, upper N-continuous) since < in L means > in L’. 


8Transfinite induction shows that L is upper N-continuous if and only if, for arbitrary 
{a,;’ € A} and for every x, Zp(xZ(a,; € F)) exists and equals xZ a, (F varies over all 
finite subsets of A), lower X% -continuous if and only if, for arbitrary {a,;\ € A} and for every 
x, p(x + I(a,; € F)) exists and equals x + Ila. 

An equivalent definition of continuity in terms of directed families {a,} was given by 
U. Sasaki who used a lemma of T. Iwamura (see (4) or (2, Appendix I1) ). 
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If [0,a,] is N-continuous (upper XN-continuous, lower N-continuous) for 
every \ then [0,>¢a,] must be locally N-continuous (upper X-continuous, 
lower N-continuous); for if x #0 and x < dca then (x,a,)P is false for 
some A, so (0, x,] is lattice isomorphic to [0, a,’] for some a,’ < a, and some 
x, # 0 with x, < x. But (0, a’] is X-continuous (upper N-continuous, lower 
X-continuous) along with [0, a}, so [0, x,] has the same property. This proves 
that [0, }-a,] is locally N-continuous (upper X-continuous, lower N-continuous). 


3. Independence theory. In this section we assume L is an X-complete 
modular lattice with zero. Since we do not make any continuity assumptions 
we need to refine the von Neumann independence theory.* in so far as it 
applies to infinite families of elements. In particular, in Theorem 3.1 below, 
we use a complementation argument to replace the usual “‘continuity”’ argu- 
ment. 

If {a,;’ € A} is a set of elements in L we use the following notation: 


a, denotes 5 (a,;u ¥ X), 
ar denotes )(q;A €T) if TCA, 
ar denotes S(a;A¢T) if TC A(in particular, a, = 0). 


Definition 3.1. A family {a,;A € A} is called independent if aay = 0 when- 
ever F is a finite subset of A and d ¢ F; residually independent if aa, = 0 
for every X. 


Definition 3.2. If {a,} is residually independent the residual element of 
{a,} is defined to be [lax; an element x is called a residual element in L (more 
precisely, am N-residual element in L) if x is the residual element of some 
residually independent family {a,;\ € A} with A = X&. 

If {a,} is residually independent with residual element 0 then {a} is called 
strongly independent. 

Because of the modular law, the following statements follow easily: 

Independence of {a,} is equivalent to: ayag = 0 whenever F, G are finite, 
disjoint subsets of A, and also.to: 


II, (qm; € F,) = D(a € ON F,) 


for every finite collection of finite subsets F, of A. 

Residual independence implies independence and is equivalent to: arar = 0 
for every finite subset F of A. 

Strong independence implies residual independence and is equivalent to 
the single condition Ia* = 0. (It will follow from Theorem 3.1 below that 
strong independence of {a,} is equivalent to: for every collection of subsets 
I, of A, ILS (aq; € I,) exists and equals 5 (a,; 4 € O J,).) 


*The von Neumann theory of independence can be found in (3, Part I, chapter I). 
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If a,(a; +... + a,1) = 0 for i > 2, then the finite or infinite family 
{a,;” > 1} is independent. Hence: 

if {b;,..., bm, 1, ..., @,} is independent and c, +... + & < bi +... + dm 
and {ci,...,¢,} 1s independent, then {c,,..., ee a,} ts independent; 
a generalization of this fact is proved in the Corollary to Theorem 3.1. 

If db, < a for each A and {a} is independent (residually independent, 
strongly independent), then {d} is independent (residually independent, 
strongly independent). 

If {b,, &; A © A} is independent (residually independent, strongly indepen- 
dent) then {db + q} is independent (residually independent, strongly inde- 
pendent). 

{a,;” = 1,2,...} is residually independent if and only if a,>— (a,,; m > n) =0 
for every nm = 1,2,..., strongly independent if and only if residually inde- 
pendent with IL. (¥ (an; m>n)) = 0. If {a,} is strongly independent, then 


(3.1) a<>d 6 a(S o.) =0 


t=n i=n+1 


for every imply {c,} is strongly independent. 

If {a,},{d,} are both independent (residually independent) and (S~a,) (}>d,) =0 
then {a, + d;A € A} and {a,,d;A € A} are both independent (residually 
independent) (the Corollary to Theorem 3.1 below shows that if {a,},{d,} are 
both strongly independent with (}°a,)(5-)) = 0 then {a, + &;A € A} and 
{a,, bs; € A} are both strongly independent). 

If L is upper N-continuous then independence implies strong independence 
for families {a,;4 € A} with A < & (this was shown first by von Neumann 


(3, Part I, Chapter I1)). 


THEOREM 3.1. Suppose {a,} is strongly independent and for an arbitrary set 


of u, Oy.» < a for all , wu. Then I1,a,., exists for each d if II, (Say,,) exists. On 
the other hand, 


TLS, Qy,») — > dL, Ay») 


(that is, both sides exist and are equal) provided that for each i, the element 
Ila, exists and has a complement in a, (in particular, if Il,a,,, = 0 for 
every i). 


Proof. 1. If Il,a,,, exists let it be denoted as dy. 
Clearly, if I1,(S,a,,,) exists, then for each », 


aJI,(X)a,,) =II,(a,,, + 0) 


since {a,} is strongly independent, so b, exists for each v. 


2. Clearly }>,a,,, > Sab, for every uw. So we need only show: if for some 
x, >a, 4 > x for all uw, then x < }yd. 
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But if for all u, ,a,,, > x then for all A, wu, 
x < ay. + a, 
(x + ax)ar < (Grn + ar)ar = ah,5; 
then for every \X, 
(x + ax)a, < Ia, = dr, 
x<xt+a,<h +a; 
finally, 
x < II,(b, + a3). 
Thus the theorem will be completely proved if we establish 
I1,(d, + a.) = Labs. 
3. Now suppose c,@ b, = a, for each \. Then 
Dh + La > I( + ax) > Dh, 
and the modular law now shows that we need only prove 
(I1(, + ax))(La) = 0. 
And this does hold because (b, + a*,)(oa) < a*, for each d, and IIa*, = 0. 


COROLLARY. (Strong independence under substitution). Suppose L is an 
X-complete modular lattice with zero, {a,; € A} is strongly independent and 


r, A,... are mutually disjoint subsets of A. If {c,}, {d,},... are each strongly 
independent with sets of indices u,v, . . . each of cardinal power < Nand ay>>dc,, 
a, > Dd d,,..., then the set of all elements {all c,, all d,,...} is strongly inde- 
pendent. 


Proof. Since a*;y <I1(a*,;\ € IT) the meet a*pa*, ... <I] (a*,; € A) = 0. 
So {arp, @4,...} is strongly independent. 

To prove fall c,, all d,,...} strongly independent we form the union }>* 
of all c,, all d, omitting one of these elements and we need only prove that all 
such >>* have 0 as meet. 

But if c, is omitted, }>* = c*, + a*p. Then Theorem 3.1, applied to the 
family {ayp,a,,...} shows that 


II >* = Ic +01 +...=04+0+...=0. 


Remark. In the case that L is complemented Theorem 3.1 is equivalent to 
the statement: if {a,} is strongly independent, then the set Lo of all 0x, with 
x, < a, is a sublattice of L, isomorphic by the correspondence })x +> {x} 
to the direct product of the lattices [0, a,]; Lo has the property that if any 
family of elements in Zo has a union or meet in L, then this union or meet 
is in Lo. 

At this point we introduce an important generalization of the conjointness 
relationship ‘‘(a, b, c)C’’ of von Neumann (see § 2). 
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Definition 3.3. A family of ordered triplets { (x, a,b), € A} is called a 
C-system (more precisely, an X C-system if A = &, and sometimes a C-sequence 
if A is countable) if: 

(i) (x,, &, b,)C for every \, as defined in § 2; 

(ii) {d>x,, db; 4 € A} is strongly independent. 

We shall write: {(x,, a, db); A € A}C to denote that {(x, aq, bd); A € A} isa 
C-system. 

Clearly, if x denotes }>x, and { (x), c, b,)}C, then: 

(iii) x, = x(c + d) for every 4X, 

(iv) x® q = x@® Jd for every *. 

(v) x<LatLh. 

Conversely, if some given x, {b,}, {q} satisfy (iv), (v), and 

(ii)’ {x, b; A € A} is strongly independent then, with x, defined by (iii), 
it is so that { (x), q&, b,)}C holds. 

Thus in a C-system the x, are uniquely determined by the elements 
{cx, bx; A € A} and the union }>x,. We shall sometimes write { (x|c, b,)}C with 
x =)>ox in place of { (x, a, d)}C. 


Lemma 3.1. If { (x|c,, b,)}C holds, then {c,} is residually independent and has 
residual element x>_ cy. 


Proof. 
<b +x. 
Hence, (by (iv), Definition 3.3), 
Ocn < (by + x) (DX + x) = Ox 
and (by (iii), Definition 3.3), 
QxXx = OX = 0. 

Thus for each A, qc*, = 0 and hence {cq} is residually independent. Next, 

xa = x( + a)(G +x) = xl + alco +x)) = x(f + c,(b; + x)). 


But {d,, 5*,, x} is independent, so, by the Corollary to Theorem 3.1, {c,, 5*,, x} 
is independent; thus c,(d*, + x) = 0 and 


xDo=xc, for each yp. 
Thus xq, = xlI,c*, < (residual element of {c,}). On the other hand, 
(residual element of {c,}) = Ile. < T(, + x) = x 
since {x,b,;X € A} is strongly independent. Thus 
(residual element of {a}) < (SLa)x 


and so equality holds. 
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Definition 3.4. A C-system {(x,, &, b,)} is called a residual C-system, if 
<x = (residual element of {c}) 
equivalently (by Lemma 3.1), if 2. < Dia. 


Remark. It is easy to see that a residual C-system with A finite must have 
all x,, &, , identically 0. But a non-trivial residual C-system can be con- 
structed whenever there exists an increasing sequence {a,} which does not 
converge continuously (this will follow immediately from Theorem 3.6 and 
the Corollary to Theorem 3.2 below). 


THEOREM 3.2. If {a,} is residually independent and a, > b,® c for every 
A, then the residual elements a, b, c (of {ay}, {db}, {cx} respectively) satisfy: 

(i) (Sd) (Sa) = be; 

(ii) a > b + c with equality if a, = b, @ c for every d. 


Proof. Each of {bh}, {ce} is residually independent along with {a,}. Now: 
(i) For each fixed yw, {a*,, b,, ¢,} is independent. Hence 


(Ch)(Xa) = bes (Ch) (Xa) = (11,8) ILS) = bc. 


(ii) @ > 6 +c is clear. But if a, = 5, @ cq for every \ then alsoa < b+ c 
for: 


(@ + Dh)(Xa) < (6, + a:)(La) = &. 
Hence 
c> (a+ Yh)(Xo), 
(Ch) +¢> (@+ Dh)(la + Ch) >a. 
Similarly (S>q) + 6 > a. Thus 
a < (Dh) + 0)((La) + 6) = 6 +064 (Lh)(La) = d+. 


Coroiiary. If L is complemented and {ay} is residually independent with 
residual element x, there exists a residual C-system {(x\c,, b,)} with a < a. 


Proof. 1. Choose X to be a complement of x and define: 
oO = [a — aX], 
by = (x — C,)X. 


2. Then a, = aX @ cq and each of {aX}, {cq} is residually independent 
since {a,} is residually independent by hypothesis. 
But 


(residual element of {a,X}) < (residual element of {a,}) = x; 


also < X, so < xX = 0. Now Theorem 3.2 shows that {ca} is residually 
independent with x as residual element. 
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3. {x, 4; € A} is strongly independent because 
ik <xt+a =a, 
lox < Ilex = x. 
But I1s*, < X, hence I14*, < xX = 0. Therefore {d,} is strongly independent. 
Since x}(b, < xX = 0, the Corollary to Theorem 3.1 shows that {x, b,;4 € A} 
is strongly independent. 
4. ax {ax = 0, he = 0 and 
bh @ x = (x® q)(X +x) = x OQ. 
Finally, >}, + a > x since, as we have already shown, {cq} is residually 


independent with x as residual element. Thus (iv), (v), and (ii)’ of Definition 
3.3. hold and it follows that { (x|c,, &,)} is a residual C-system, as required. 


THEOREM 3.3. Suppose {a} is residually independent with residual element t. 
If t = x@®_yand Y is an element with Y > x and Y® y > dq, then 
{Y(aq, + y)} 
is residually independent with residual element x. 
Remark. If L is complemented, Y could be chosen to be x + [(Sia,) — ¢]. 
Proof. Put bh = Y(a, + y). Then 
ht+y=antyrk +y=a+y =a, 
bydy < Ya; (ay +y)= Y (aya, +y) = Yy = 0. 
Thus {d} is residually independent. 
Now 5*, < Y, and Yy = 0, so 6*, = Y(b*, + y) = Ya*,. Hence 
(the residual element of {d}) = Il(Ya*,;) = Yt=x+ Yy = x. 
Coro.iary 1. Jf L is complemented then every subelement of an X-residual 


element is also an X-residual element. 


CorROLLARY 2. If L is complemented and {ay} is residually independent there 
exists a strongly independent family {b,} with >>b, < da, and by, perspective to 
a, for every d, with a common axis of perspectivity. 


Proof. Let t be the residual element of {a,} and choose Y = [(S-a) — 4], 
that is, let x = 0, y = ¢ in Theorem 3.3. Then }, = Y(a, + y) satisfies our 
requirements and for every \, d, is perspective to a, with axis Y. 


THEOREM 3.4. Additivity of perspectivity. Suppose {a, + hb; € A} ts 
residually independent and a, ~ by for every d. If {ay} and {b,} are both strongly 
independent (in particular, if {a, + b,} is strongly independent),'° then there 


Strong independence of {a, + b,} obviously implies that of each of {a,}, {b,}. The 
interested reader can verify, using Theorem 3.1 and Theorem 3.2, that if L is complemented 
then residual independence of {a, + 5,} together with strong independence of each of {ay}, 
{b,}, actually forces {a, + )} to be strongly independent. 
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exists a perspective mapping of |0,5—b,] onto [0,>-a,|] which maps b, on a, for 
each d. If {by} is strongly independent and L is complemented then > by is per- 
spective to a subelement of > ay. 


Proof. Suppose a, @ x, = 5, @® x and {a}, {d,} are both residually inde- 
pendent. Then for every fixed yu, (a, + 5,)(a*, + 5*,) = 0. Hence 


(Xx) (Cha) = xb, + xpbs = O + xpd; < by; 

(=x)(Xb) < (residual element of {d,}). 
Similarly, 

(Sx) (Sa) < (residual element of {a,}). 


Thus if {d,} is strongly independent, >>), is perspective to [}->a, — (}>x)(S-a)] 
with axis }>x. If {a,} is also strongly independent then (}>x»)(}-a) = 0; 
in this case (}>b,) ~ (Sa) with axis >», and the corresponding perspective 
mapping maps ), on a, for each X. 


Remark. If (3°b,)(S-a,) = 0 and {d} is strongly independent, then residual 
independence of {a, + 4} is equivalent to residual independence of {a}, by 
application of Theorem 3.1. 


COROLLARY. Suppose {a}, {b,} are both strongly independent families. 

(i) If a, ~ b, for each X and (Sa,)(>-b)) = O then there is a perspective 
mapping of [0,>-a,] onto [0,5°b,] which maps a, on by, for each x. 

(ii) If a, = b, and L can be doubled then there is a lattice isomorphism of 
[0, 5-a,] onto [0,>°b,] which maps a, on by for each d; if alsoy°-b, < Sa, and L 
is complemented then [>-a, — >-b,] is a member of an independent sequence of 
mutually perspective elements. 


Proof of (i): Theorem 3.4 shows this since {a, + },} is strongly independent 
by the Corollary to Theorem 3.1, under the present hypotheses. 


Proof of (ii): We may suppose that-L = [0, c] with c an element in a modular 
lattice L; such that [0, c] can be mapped by a perspective mapping ¢ onto 
0, u] for some u in L, with cu = 0. Then a, ~ $(d) for each A (by repeated 
applications of (2.2)). 

Since (Sa,)(5¢(h,)) = 0, there exists, by (i) above, a perspective mapping 
¥ of [0,>°a,] onto [0,>°>¢()] which maps each a, on $(d). 

Now ¢~'y is a lattice isomorphism of [0,>-a,] onto [0,5°] as required. 

If finally >}, < Soa, and L is complemented, let x; = [()0a,) — (A )]; 
and for nm > 1 define x,4; by induction: 


Xn+1 = op (xp). 


Then {x,} is an independent sequence since x,(}>(x»,;m > n)) = 0 (this 
follows from repeated applications of ¢~'y to the identity x,(>0 (x,; m > 1)) < 
x1(>-b) = 0). 
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Since x, ~ $(x,) and $(x,) ~ Xn41 and {x,, 6(x,), Xn41} is independent for 
each n, therefore x, ~ X41. Then by (2.2), x, ~ x» for all m, m. This proves (ii). 


THEOREM 3.5. Extension of perspective mapping. 

Suppose { (xy, Cr, b,)} and {(x,’, cy’, dy’)} are both C-systems and }>x < x, 
Lm’ < x’ and {x + x’,>b,>d,’} is independent. Then any perspective map- 
ping of |0, x] onto (0, x’] which maps x, on x,' for every , can be extended to a 
perspective mapping of (0, x + 5d] onto [0, x’ + >-b,'] which maps b, on by’ 
and c, on c,' for every X. 


Proof. 1 The given perspective mapping of [0, x] onto [0, x’] is determined 
by some axis of perspectivity a with: 


x@a=x'@a=-x+-’. 
2. We shall choose y, below so that 
(3.2) wn Or=n OK = bh Oh; 


it will then follow immediately, as in the proof of Theorem 3.4, that the axis 
a+>y gives a perspective mapping of [0,x +] onto [0,x’ +d’) 
which fulfills all our requirements except possibly for the requirement: 
(3.3) c should be mapped onto a’ for each X. 
3. Our choice of y is: 
w= (@+ 4 + )(d, + Hy) 
and we verify that (3.2) holds, as follows: 


(i) Da + be = (2 + dy + oy + ch) (dr, + Hf) 


and 


@tht+atagazratathat+aqzratmytat+aqzratmt hat k 
so nth =h+ dy’. 

Similarly Vr a b,’ = by + db,’ sO Vr + by, = 7, + b,’ = oN + by’. 

(ii) The hypotheses imply that {4, d,’,x + x’} is an independent family 
for each 4. Now successive applications of the Corollary to Theorem 3.1 show 
in turn that each of the following is independent: 


{by, bx, xx, a}, { by, ch, xh, a}, 
{by, Ch, %, a}, {Dy, ch, Cr, a}. 

Therefore 
hyn = @+at+ayh = 0. 

Similarly 


DY = 0. 
(i) and (ii) prove that (3.2) holds. 
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4. Finally, we verify that a + }-y does satisfy (3.3), as follows: 


(the map of q) = (¢ + Dy, + a) (x’ + Hj) 
>a@tntala=@tatayat+k+at+a)a 
> (K+mtala> (K+ ma = a, 


that is, (map of q) > aq’. Similarly: (map of q’) > q. Since the mappings 
are inverse perspective mappings, equality must then hold in the preceding 
two relations and the theorem is completely proved. 


THEOREM 3.6. Suppose L is complemented and X&'-continuous for every ®’ < &. 
Suppose also that {cs;8 <Q} is an increasing family with xcg = 0 for every 
8B <Q for some fixed x with x < S-cg. Then there exists a residually independent 
family {ag;8 <Q} such that: 


(3.4) > a, = & for ever 8 < Q, 
<8 
(3.5) the residual element of {ag} is > x. 


Proof. By transfinite induction we shall define for each 8 < 2 a complement 
Cz of cg such that Cs > x and C, > C;, for all y < 8. 

We choose C, to be any complement of c, with C,; > x. Then for 8 > 1, 
by transfinite induction, we choose Cs to be a relative complement 
(TregCs — ce(L]sesCs)] with Cy > x. This is possible since, by the inductive 
assumption, T]esCs > x and xcg(L],esC;) < xcs = 0; this choice of Cs does 
give a complement of cg because 


(II &) +o- I] (Cs + @) = I] @) =1 


$< 


due to the assumption that L is lower X’-continuous for &’ < X&. 

Now choose a; = ¢;, and for 1 < 8 < Q, choose ag = ca(L1,<sCs). 

Then (3.4) holds; for by. transfinite induction on 8, it follows that for 
every 8: 


(Loot I] %) = TT (244%) = Tm =1: 


6-8 <8 
a=a(X cy + T] C:) = La +a(T] C:) = > a, + a = Dd a. 
¥< 6<f <A 6<f <8 <8 


Next, {as;8 <Q} is residually independent; for 

as(D (ay; y # B)) = apce(D (a,; y ¥ 8)) 
(2 (ay; 7 < 8B) + ce(D (ay; y > B))Cp) 
ag(> (a,;7 < 8)) = 0 


since L is upper X’-continuous for X’ < &. 
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Finally, for each y < Q, 
x = x(Dcs) < C,(QLas) = D (as; 8 > 7) 
so (3.5) holds. 


COROLLARY 1. Suppose L is a complemented, No-complete modular lattice. 
Then L is upper No-continuous if and only if every residually independent 
sequence is strongly independent. More generally, if L is a complemented X- 
complete modular lattice, and L is X'-continuous for all &' < &, then L is 
upper X-continuous if and only if every N-residual element is 0." 


COROLLARY 2. Suppose L is complemented and X&'-continuous for every 
N’ < &. If {a} is independent then there exists a strongly independent family 
{b,} such that a, = b, for each x. 


Proof. We may suppose the a, are well-ordered and indexed as {ag; 8 < Q}. 
Let cs = > (a,; y < 8). Then since {as} is independent and L is upper N’- 
continuous for every &’ < & it follows that for every 8 < Q, 


apd. (Cy; ¥ < B) = 0, ag + D(Gi ¥ < B) = &. 
Now apply Theorem 3.6 (with x = 0) to the increasing family {cg}; it 
follows that there exists a residually independent family {a,g’; 8 < Q} with 
Llay;7 <8) =cs for every 8 < Q. 


Clearly for every 8, ag ~ ag’ with axis }>(c,; y < 8). 

Now let the residual element of {ag’} be denoted as y and let Y be a com- 
plement of y. Then by Theorem 3.3 the elements bg = Y(a,’ + y) form a 
strongly independent family. 

Since bs ~ as’ (with axis y) for each 8, and as’ ~ ag (as shown above), 
therefore bs = ag and so the bg satisfy our requirements. 


4. Additivity of continuity. In this section and in §§ 5, 6, 7, we assume 
that L is a complemented, N-complete modular lattice. 


THEOREM 4.1. Suppose a, T a, 5, 7 b, and both ay, b, converge continuously. 
If a,b, = 0 for every X (equivalently, if ab = 0), then a, + by also converges 
continuously. 


Proof. We may suppose x(a, + ),) = 0 for every \ (which implies x(a,+,) 
= 0, (x + a)d, = 0 for all A, u) and need only prove that x(a + d) = 0. 
But the continuous convergence of b, yields for every A, (x + a,)b = 0, and 
so (x + b)a, = 0; continuous convergence of a, yields (x + b)a = 0, hence 


x(a + b) = x(a(x + 5b) + 6) = xb = } (xd) = O as required. 
THEOREM 4.2. If [a, 1] is upper NR-continuous and cf 1 in L then>. (ac) = a. 
Proof. 1. First consider the case that for every A, aq, = 0. We shall show 


“Jn the case NX = No no continuity assumption is implied. 
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that in this case (a, c,)P holds for each uw. Then since }-c, = 1, this implies 
(a, 1)P, hence a = 0, as required. 

To show (a, c,)P holds we let C, be an arbitrary complement of c, and we 
need only prove that C, > a (see 2.6)). But if A > u, 


oO = a(g.@ C,) = & + aC,. 
Hence 
&@ C, = 1 =Dan = Dale, + aC,) 
= ¢@ Lalac,) 


by the definition of lattice union. Since C, > }>,(aC,), the modular law implies 
that C, = >,(aC,). Hence a + C, = a+ D,(aC,) = Lala + GC,), by the 
definition of lattice union. Then 
(a+ C,)(a+¢) = (Lala + aC,)) (a + &) 

= Dal((a + aC,)(a + ¢,)) 
since [a, 1] is upper X-continuous, 

-_ dala + C,(a + oC,)) 

=<¢ 
because 

¢.(a + aC,) = ¢,(a(G + aC,) + aC,) < o.(aq + C,) = (0+ C,) = 0. 


Thus, in turn, 


(a+ Co = (a+ C,)(a+¢)c¢ = ac, = 0; 


(a+ C,)o.+ C, = C,; 
a+C, = C,; 
a <€ C, 


as required. 


2. In the general case, let’ ay = > a(aq). Then ao < a and (a) + q)a = ao 
for every \. Since (ao + &) T 1 in the lattice [ao, 1], we can apply the argument 
of the preceding paragraph with [ao, 1] in place of L. We obtain: a = ao, that 
is, >. (ac,) = a, as required. 


THEOREM 4.3. Additivity of upper XN-continuity. If both (0, a], [0,5] are 
upper X-continuous then (0, a + b] is also upper N-continuous. 


Proof. We may clearly suppose a@® 6 = 1, q& 71 and need only prove 
(xc,) T x for every x. But aq, bq, both converge continuously; hence, by 
Theorem 4.1, ac, + bc, converges continuously. 

By (2.1), [a, 1] is lattice isomorphic to [0,6] and hence is upper N-con- 
tinuous. Then, by Theorem 4.2, }\(aq) = a. Similarly 3° (bq) = b. So 
Dd (aq + bq) =~at+5=1. 
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But we have shown that ac, + bq converges continuously; so for every 
x, x > DL (xa) > Lx(aq + ba) = x. This shows that (xc) Tx and proves 
Theorem 4.3. 


THEOREM 4.4. (Generalization of Theorem 4.3.) Suppose L is upper 
N’-continuous for some X' < XN and [0,a,] is upper N-continuous for each 
uw € T with T < RX’. Then (0,>a,] is upper X-continuous. 


Proof. 1. We may suppose that &’ is infinite since Theorem 4.3 shows that 
Theorem 4.4 holds for finite X’. 

2. We shall prove Theorem 4.4 by transfinite induction on X’; we may 
therefore suppose that Theorem 4.4 holds for all cardinals less than the 
given infinite NX’. 

3. We may now suppose that the indices uw are arranged as the set of 
ordinal numbers 8 < Q,, where Q, is the least ordinal number of corresponding 
cardinal power WN’. 

4. Since [0, }> (as; 8 < y)] is upper N-continuous for every y <Q (by 
the inductive assumption), we may assume that {ag} is increasing, say ag f a. 

5. Thus we may suppose: 

(i) Foreach 8 < Q,, (0, as] is upper X-continuous and ag T a with continuous 
convergence (since L is assumed to be upper X’-continuous). 

And we need only prove that [0, a] is upper N-continuous. 

It is sufficient to prove: 

(ii) c, T a, xc, = 0 for all y < Q, for some 2, < Q together imply xa = 0. 

6. For each 8, (c,ag) T dg where dg = >-,(c,a5) < az. 

Clearly {as; 8 < Q,} is an increasing family, along with {ag}, and con- 
verges continuously since L is assumed to be upper N’-continuous. Hence, 
for every vy, 


Cy(Lisds) = Dp(cas). 


Now 


Leds = Ly8(Gas) = Ly (Diecyas) 
= >,(c,a) since ag T a, continuous convergence, 
a >) aby since c, < a for every vy, 
=a since c, Tf a, by hypothesis. 


Thus dg 7 a and the convergence is continuous. 
Next, for each 8, 


xdg = X24 (c,a) 

> 1(xc,ag) since [0, ag] is upper NX-continuous, 
=> ,(0) since xc, = 0 for very vy, 

= (0. 


ll 


This proves the theorem. 
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5. Homogeneous sequences. We assume, as in § 4, that L is a comple- 
mented X-complete, modular lattice but most of this section involves only 
the complemented countably complete modular lattices. 


Definition 5.1. A sequence {a,} is called homogeneous if {a,} is strongly 
independent and the a, are pairwise perspective. 


Definition 5.2. If {a,} is a sequence in L then for any complement A of 
Da, the sequence {a*, + A} is called a dual sequence of {a,}. 


Remark 1. Each dual sequence of {a,} is strongly independent in L’; if 
{a,} is strongly independent in Z then each of its dual sequences, considered 
in L’, has the original {a,} as a dual sequence. 


Remark 2. lf {a,} is homogeneous in L then each of its dual sequences is 
homogeneous in L’. 


Definition 5.3. A homogeneous sequence {a,} is said to be of type (A) if 
all the a, possess a common complement (equivalently, a common relative 
complement in }-a,), that is, there exists an element A such that a,@ A = 1 
for all n. 

A homogeneous sequence {a,} is said to be of type (A*) if all the a*, have 
a common complement (equivalently, a common relative complement in 


La,). 


Remark 1. Clearly if {a,} is strongly independent, then {a,} is homogeneous 
and of type (A), or (A*), if and only if one (hence all) of its dual sequences 
is homogeneous and of type (A*), or (A) respectively, in L’. 

Hence, if every homogeneous sequence in L is of type (A), or if every 
homogeneous sequence in L is of type (A*), then every homogeneous sequence 
in L’ is of type (A*) or (A), respectively. 


Remark 2. If {a,} is a homogeneous sequence and x; < a;, then any set 
of perspective mappings of [0, a,] onto [0,a,] when applied to x, will yield 
a homogeneous sequence {x,} (Theorem 5.1 below and its Corollary 1 will 
imply that if {a,} is of type (A), or (A*), then {x,} has the same property). 


Remark 3. lf {a,} is a homogeneous sequence then every infinite subsequence 
is also homogeneous; and if {a,} is of type (A), or (A*), then every infinite 
subsequence is of the same type. 

LemMaA 5.1. If {a@o, @1,...,@n,...} 18 @ homogeneous sequence of type (A) 
there exists at least one C-sequence {(do|C,,@,)} such that acd.c, = 0. 


Remark. From Theorem 5.1 below it will follow that under the hypothesis 
of Lemma 5.1 every C-sequence {(ao|c,,@,)} has the property aoc, = 0. 


Proof. Let A be a common complement of the a,. Choose c, = A (ado + a,) 
for m > 1. Then 


Aol, = Anl, = 0, Ao @ Cy = GoD Gn = Ch®D Ay. 
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The lemma follows since {ao, a:, . . .} is strongly independent (by hypothesis) 
and ao)_¢, < aA = 0. 


LEMMA 5.2. Suppose {ao, ai, ...} is a@ homogeneous sequence of type (A) and 
Ada, = 0. If x < A, (x,da,)P together imply x = 0 (in particular, if A is 
perspective to a subelement of >-a,), then (0, A] is upper No-continuous. 


Proof 1. By Corollary 1 to Theorem 3.6 we need only prove that every 
residually independent sequence in [0, A] has residual element zero. We may 
therefore suppose that x, (# 0) is the residual element of some residually 
independent sequence in [0, A] and we need only derive a contradiction. 

2. The hypotheses imply that (x;,}-a,)P is false; therefore (x:,a,)P is 
false for some m, hence (x, a0)P is false since a, ~ ao. Thus there exists 
x #0 with x < x; and x perspective to a subelement of ap. Theorem 3.3 
shows that x is the residual element of some residually independent sequence 
in [0, A]. 

By Remark 2 following Definition 5.3 we may suppose (by replacement 
of a, by suitable subelements) that x is perspective to ao, say by a perspective 
mapping ¢. 

3. By the Corollary to Theorem 3.2 there exists a residual C-sequence 
{ (x|cn, bn)} with oc, +>), < A; then x is the residual element of {c,} and 
x =) x, for suitable x, such that {(x,, c,, 5,)}C holds. 

4. By Lemma 5.1 there exists a C-sequence {(a/d,, a,)} with ac>_d, = 0. 
We shall derive a contradiction in the following way: we shall construct a 
C-sequence { (x,’, c,’, b,’)} with: 


(i) Xm = (xn) 
(ii) Cn < dn, On < ap. 


(i) will imply that >ox,’ = ¢(Sx,) = (x) = ao and (ii) will imply that 
(d/c,’) (2 x,’) = 0. Then the “extension of perspective mapping’? Theorem 
3.5 will apply and give an extension of ¢ (which we write again as ¢) such 
that ¢(c,) = c,’ for all m. This will yield: 


o(x Dic.) = o(x) DY o(c,) 
ao Dc, < ao Dd, = 0 
and imply that x = 0, the desired contradiction. 
5. The reader can verify easily that the elements x,’ =@(x,), Cn’ = (xn' +@n)dh, 


b,’ = (ao+c,’)a, form a C-sequence satisfying (i), (ii) above. Thus Lemma 5.2 
is proved. 


(x) 


LemMA 5.3. Suppose {x,a,;m > 1} and {x,b,;m >1} are homogeneous 
sequences with {x, >°an, >-b,} independent. Then S = {x, ai, bi, do, b2,...} is 
a homogeneous sequence. Moreover if both {x,a,;n > 1} and {x, b,;m > 1} are 
of type (A) then S is also of type (A). 
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Proof. S is strongly independent, by application of the Corollary to Theorem 
3.1. Moreover, all of x, a,(m > 1), 6, (m > 1) are pairwise perspective so S is 
a homogeneous sequence. 

Now suppose A and B are common relative complements of each of x, a, 
in x + dia, and of each of x, b, in x + b,, respectively. 

Then A + B is a common relative complement of each of x, a,, 5, in 
x + da, + D),; for if ¢ is x or a, then 

c(A+B)=cA=0 since B(x + da,)(x +>db,) = Bx = 0. 
Similarly c(A + B) = Oif cis b,. Itisclear thatc + A + B = x + Ya,+D6, 
if ¢ is x, ay, or b,. This proves Lemma 5.3. 

Now we shall prove: 


THEOREM 5.1. The following conditions are equivalent for a homogeneous 
sequence {a,}: 
(i) [0,>¢a,] is upper No-continuous. 
(ii) > 4_1@4 converges continuously. 


(iii) {a,} is of type (A). 


Proof. 1. (i) implies (ii): this is trivial. 

2. (ii) implies (iii): By Lemma 5.3 it is sufficient to prove that {a2,} is of 
type (A). 

Suppose (@2_,%,,d2n+2)C. Then {x,} is a homogeneous sequence with x, ~da,-1 
for all m>1 (use (3.1) and (2.2)). Since (}>x,) (So @an—1) < (So@en) (D-Gon-1) = 0, 
Theorem 3.4 shows that there exists a perspective mapping of [0, }>x,] onto 
[0, S-aen-1] which maps x, ON Go_-1. 

But >-7- 1@2;-1 converges continuously: to see this, observe that for every y, 


JE oie (VE om \(Ea)= £(E mE) 
£6 a) 


n=l 


Therefore, "%-1x, converges continuously. But a2,-%.1x, = 0 for every 
n, so x = >_x, satisfies a,x = 0 for every p. Obviously a2, + x = Sa» for 
every p so the a2, all have x as common relative complement in }(a2,. This 
proves that {a2,} is of type (A) and shows that (ii) implies (iii). 

3. (iii) implies (i): In Lemma 5.1 use A = }odo,. Since {@2,_1} is of type (A) 
and A>-da,-1 = 0, A ~ Sden-1, therefore Lemma 5.1 applies and shows that 
[0, $a2,] is upper No-continuous. Similarly [0, }-a2,_,] is upper No-continuous. 

Now by Theorem 4.3 (the additivity of continuity), [0,>0a,] is upper 
No-continuous. 


Coro.iary 1. The following are equivalent for a homogeneous sequence {a,}: 
(i) [0,>-a,] is lower No-continuous. 

(ii) }-Fen@, converges continuously. 

(iii) {a,} is of type (A*). 
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Proof. Apply Theorem 5.1 to {a*,} in the lattice dual to [0, }-a,]. 


CoROLLARY 2. Jf a homogeneous sequence {a,} is of type (A) and also of 
type (A*) then all a, = 0. 


Proof. Let A be a common relative complement of the a*,, in }-a,. Since 
AD ta 104 < Aa*n4i = 0, and 7. 1a; converges continuously by Theorem 5.1, 
therefore A = 0. Then a*,, = }-dn, dm < Gn@*m = 0, so all a,, are 0. 


Coro.iary 3. If {a,} is a homogeneous sequence of type (A) and (0, a,] is 
upper X-continuous for every n, then [0,>-a,] is also upper N-continuous. 


Proof. This follows from Theorem 4.4. 


Remark. Corollary 3 applies, in particular, if each a, is an atom. 


6. Additivity of finiteness. We assume, as in §§ 4, 5, that L is a 
complemented X-complete modular lattice. 


Definition 6.1. L is called finite if every independent sequence of pairwise 
perspective elements has all its elements zero.'? 


THEOREM 6.1. If {c,} is am independent sequence of pairwise perspective 
elements there exists a homogeneous sequence {d,} with d; = ¢,, dm ~ C, for all 
m,n and Dd, < Dien. 


Proof. By Theorem 3.6, applied to our }-'.:¢; with x = 0 (no continuity 
is required in the hypotheses for the case X = No), there exists a residually 
independent sequence {a,} with 


"The following possible definitions of “finiteness” for a modular lattice with zero: 
(F,): as in Definition 6.1, 
(F:):a = b, b < a, imply a = 5, 
(F;):a~c,c ~b,b < aimply a = } 
are related as follows: 
(i) (F 2) implies ( F;) always. 

(ii) (1) implies ( F:) if the lattice is also complemented. 

(iii) If the lattice is also complemented and countably complete then ( F;), (F2), and (F;) are 
all equivalent. 

(iv) If the lattice is not countably complete then ( F;) need not imply ( F;); this is shown by 
the example of footnote 14 where the lattice is even orthocomplemented and perspectivity is 
actually transitive. 

(i) is trivially true. 

To prove (ii): suppose there is a projective mapping ¢ of [0, a] onto (0, b) with b < a and 
b # a. The argument used to prove (2.7) actually shows that for some 0 # x; < [a — 6] we 
have x; ~ x2 where x2 = ¢(x;). Let x, = ¢ (Xn_1) for m > 1. Then repeated application of @ 
to the relation x; ~ x2 shows that {x,} is independent and pairwise perspective, so (F;) fails 
to hold. This proves (ii). 

To prove (iii): suppose {a,} pairwise perspective and independent. By Theorem 6.1, with 
the same a;, we may assume even strong independence. Then by Theorem 3.4, 

Z(aon—1; 2 > 1) ~ Laon; nm > 1), D(aonjm > 1) ~ Zaengi; m > 1). 
Now (F;) would force a; = 0, so ( F;) implies ( F;) and (iii) follows from the previous remarks. 

















COMPLEMENTED MODULAR LATTICES 503 


Sane > «, 


tml i=l 


for every » > 1. We observe that 


n—1 
a; = c, and for > 1, a, ~ ed ais D> a= > 
t=1 tml 
Now the a, are pairwise perspective; for if p > m then c,(a, + ¢,) = 0 so 
(2.2) implies a, ~ c,. If m # mn and p > m and p > n, then {am, dq, c} is 
independent, a@_ ~ Cp ™ Gn, SO by (2.2), adn ~ dy. 
If p <n then a,(a, +c¢,) = 0 so a,~a,, 4,~¢, yield by (2.2) that 
a, ™ Cy. Thus a, ~ ¢, for all n, p and a; = ¢;. 
Now let y be the residual element of {a,} and let Y bea relative complement 
of y in Da, = Dc, with Y > a; = c. Let d, = (y + ,)Y. 
Then Theorem 3.3 shows that {d,} is strongly independent. Now d, is the 
map of a, in a perspective mapping of 


lated wn tadal 


with axis y, for any m > n. Hence the d, are pairwise perspective, along with 
the a,. Thus {d,} is a homogeneous sequence. 
The definitions show that d; = a; = c; and for every n, 


y+ Ld=yt La=yt dL & 


If n> 1, then d,~c, with axis y +3 as*"'c, (use: yO wer"es = YD wer", = O 
and y> wei"d; < yY = 0). 

But then d,,~ c, for all m # n; for d, ~ d; = ¢; ~ C, and {dm, Ci, Ca} is 
independent, so (2.2) yields dy, ~ Cp. 

Since }-d, < Sc, obviously, Theorem 6.1 is proved. 


Coro.iary. If every homogeneous sequence has all its elements zero then the 
lattice is finite. : 


Lemma 6.1. Suppose L is &'-continuous for all &' < XN and suppose L can 
be doubled. If {ag; 8 < Q} is strongly independent and x < Sag but x>_ (a,; y<8) 
= 0 for all 8 < Q then x is a member of a homogeneous sequence. 


Proof. Let X = [(S-as) — x] and define dg = (ag +x)X. Then {dg} is 
obviously an independent family and ag ~ dg for each 8. Now Corollary 2 to 
Theorem 3.6, applied to [0, X], gives a strongly independent family {dg} 
with bg < X and dg = bg. 

Since {ag}, {bs} are both strongly independent and ag = bg for every 8, 
and >-bs < dag, therefore (ii) of the Corollary to Theorem 3.4, together 
with Theorem 6.1, show that [Sas — }0bs] is a member of a homogeneous 
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sequence. But the relative complement [}-ag — }- bs] could be chosen > x 
so (use Remark 2 following Definition 5.3) x itself is a member of a homo- 
geneous sequence, as stated. 


THEOREM 6.2. The following properties are equivalent: 

(i) every homogeneous sequence in L is of type (A), 

(ii) for every strongly independent sequence {a,} for which [0,>-a,} can be 
doubled, >> ;.:"a; converges continuously. 


Proof. 1. (ii) implies (i): Let {x,} be a homogeneous sequence. Then each 
of {xan}, {X2n-1} is strongly independent (in fact, a homogeneous sequence) 
and each of }°x2,, }>x2,-1 can be doubled (in fact, (Sx) (Soxo,.-1) = 0 and 
(Sox2n,) ~ (Sxn-1) by (i) of the Corollary to Theorem 3.4). 

Now if (ii) holds, then each of 


n n 
> X24, yD X2i-1 
i=] t=1 


converges continuously so by Theorem 5.1, each of [0,}>x2,], [0, ¥-x2,-:] is 
upper No-continuous; hence [0, }>x,] is upper No-continuous by Theorem 4.3. 
Finally {x,} is of type (A) by Theorem 5.1. So (ii) implies (i). 

2. (i) implies (ii). Suppose (i) holds. We may suppose that {a,} is strongly 
independent and that the lattice [0,5°a,] can be doubled and we need only 
prove that >> ;.:"a@; converges continuously. 

We may suppose that there exists an element x # 0 such that x < da, 
and x>. ;.:"a,; = 0 for all m and we need only derive a contradiction. 

3. By replacing each a, by a,(x + a*,) we may even suppose that a, is 
perspective to a subelement of x (observe: > a,(x + a*,) < oan, so 
[0, >-a,(x + a*,)]| can be doubled and has property (i) along with [0,}-a,]; 
also {a,(x + a*,)} is strongly independent, along with {a,}; finally, 
a,(x + a*,) ~ [x — xa*,] with axis a*,). 

4. We shall show now that [0,>¢a,] is upper No-continuous; this implies 
that x = 0 and gives the desired contradiction. 

5. In the present situation, Lemma 6.1 applies, with [0,>°a,] in place 
of L, and shows that there exists a homogeneous sequence {x,} in [0, >-a,] 
with x = x,. The validity of (i) in [0, }¢a,] then implies that {x,} is of type (A). 

6. Since [0,>°a,] can be doubled we may (and shall) assume that [0, }-a,] 
is identified with [0, v] in some modular lattice with zero, Li, in such a way 
that there exists a perspective mapping @ of [0,>¢a,] onto [0, u] for some 
u in L; with ua, = 0 (we do not know that L; is complemented and has 
property (i) but [0 ,>-a,], but so also (0, u], does have these properties). Then 
{@(x,)} is a homogeneous sequence of type (A) along with {x,}. 

Since a, is perspective to a subelement of x, and x is perspective to x, and 
(a, + x + x,)(x,) = 0, (2.2) and x, ~ ¢(x,) imply that a, is perspective 
to a subelement of ¢(x,). 

By (i) of the Corollary to Theorem 3.4, }-a, is perspective to a subelement 
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of '¢(x,). But [0,>°¢(x,)] is upper No-continuous by Theorem 5.1, hence 
(0, 3°a,] is also upper No-continuous. This completes the proof of Theorem 6.2. 


LemMA 6.2. If {c,} is a strongly independent sequence and a is an arbitrary 
element, there exists a decomposition Cc, = Cn’ ® C,'' with the properties: 

(i) ¢,' ~ d, for some strongly independent {d,} with Sd, < a(Socn), 

(ii) adc,” = 0. 


Proof. 1. Put 
Q= (a +> cn) , Gf = [oq — ci). 


m>n 


2. (ii) is immediate since 
o( cz) = o( cr(o o } # a2) ~ > a) 


m=—n m>n m>n 
=a) cn; 
m>n 


a(Sca) =aL(ciim>2)=...=aDL(ciim>n) 
= all, H(ci;m>n) <ILc =0. 


3. To prove (i) we note that 


o@® Le om<at De he 


m>n 


hence 


(6.1) 20> m=d0 Dd mm 


m>n m>n 


for suitable d, < a. 

Now (6.1) shows that d, ~ c,’ (axis }>m>nCm) and {d,} is strongly independent 
by (3.1). 

Since each dn <2c,, and <a, therefore )’d, < ajc, and Lemma 6.2 is 
proved. 


THEOREM 6.3. If in [0,a] and in [0,6] every homogeneous sequence is of 
type (A) then this is true in (0, a + 6). 


Proof. We may suppose a@ b = 1. By Theorem 6.2 we need only prove: 
if {c,} is strongly independent and [0,>}¢c,] can be doubled, then > ,2:"c, 
converges continuously. 

We shall use the decomposition of ¢,, ¢, = Cn’ @® c’’ and the d,, provided 
by Lemma 6.2 for the present a. We shall show: 

(i) > w"c,’ converges continuously, 

(ii) } 1"c,’ converges continuously. 

It will then follow from Theorem 4.1 that }°>:"c, converges continuously, 
proving the theorem. 
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To prove (ii): We note that (0, }°c,’’] is mapped by a perspective mapping 
(axis a) on a sublattice of [0, 5]. Since this sublattice can be doubled (it is 
lattice isomorphic to [0, }-c,”"] and }-c,”" < ¥c,), (ii) follows from the assumed 
properties of [0, 5]. 

To prove (i), we observe that (ii) of the Corollary to Theorem 3.4 applies 
to the lattice [0, }°c,] since {d,}, {c,’} are each strongly independent, d, ~ c,"’ 
for each n, d, < dom, Cn” < Licm and [0, }-c,] can be doubled. Thus [0, ¥c,’’] 
is lattice isomorphic to [0, >-d,]. 

Since [0,}>d,] can be doubled (along with [0, }c,’’], along with (0, ¥-c,]) 
and since }-d, < a, it follows from the hypothesis that > ,:"d,; converges 
continuously. Hence > ;.:"c,’ also converges continuously. 

This proves (i) and completes the proof of Theorem 6.3. 


Coro.iary 1. Jf in [0,a] and in [0,6] every homogeneous sequence is of 
type (A*) then this is also true in (0, a + 8). 


Proof. We may suppose a @ 6 = 1. Now Theorem 6.3 (for L’) implies: if 
in each of [a, 1]’, [b, 1]’ comsidered as sublattices of L’, every homogeneous 
sequence is of type (A), then this is true in [ab, 1]’, that is, [0, 1]’. 

But [a, 1]’, [b, 1]’ are anti-isomorphic to [0, 5], (0, a] respectively, by (2.1). 
Thus, if every homogeneous sequence in [0, a] or (0, 5] is of type (A*) then 
every homogeneous sequence in [b, 1]’ or [a, 1]’ is of type (A) (use the Remark 
1 following Definition 5.3); hence every homogeneous sequence in [0, 1]’ is 
of type (A); finally every homogeneous sequence in [0, 1] is of type (A*) 
(again using Remark 1 following Definition 5.3). 

This proves Corollary 1. 


COROLLARY 2. Additivity of finiteness. If each of (0, a], [0,5] is finite, so 
is (0, a + 5}. 


Proof. lf {a,} is a homogeneous sequence in [0, a + 5], then {a,} is of type 
(A), and also of type (A*) by Theorem 6.3 and its Corollary 1. Then, by 
Corollary 2 to Theorem 5.1, all a, are 0. 

Then, by the Corollary to Theorem 6.1, [0, a + 3] is finite. 


7. Unrestricted additivity of continuity in finite lattices. 
We assume that L is a complemented N-complete modular lattice. 


Lemma 7.1. Suppose L is upper X-continuous. Then for every family {ag; 8 < Q} 
there exists a strongly independent family { as} such that dg < ay and > dg = Dag. 


Proof. Put ds = [ag — as)-(a,; y < 8)]. Obviously {ds} is independent, 
ds < ag and by transfinite induction on y, >> (ds; 8 < vy) = ¥(as; 8 < y) for 
all y < Q. 

Strong independence of {ds} is equivalent to independence of {d,s} since L 
is upper X-continuous (see the last sentence preceding Theorem 3.1). 








— FD A we ee 
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Coro.iary. If x is an &-residual element and (0, x] is upper N-continuous, 
then x can be doubled in L. 


Proof. By the Corollary to Theorem 3.2 there exists a residual C-system 
{ (xg, Ce, bs)} with x = Dox. 

By Lemma 7.1, Zs = [xg — xs)_(x,; 7 < 8)] has the properties: {Z,} is 
strongly independent, %, < xg and x = }-Z,. 

Let bs = bg(Zs + cs). Then {6,} is strongly independent (along with {bg}), 
Zs, ~ bs for each 8 and (Zs) (bs) < x¥lbs = 0. Now by (i) of the Corollary 
to Theorem 3.4, x ~ by. Since xb, = 0 this proves that x can be doubled 
in L. 

LemMA 7.2. Suppose x < y, y® Y = 1 with [0, Y] upper X-continuous. If 


there exists an increasing family {cs} with xcg = 0 for every B and x < dcp, 
then there exists such an increasing family with > cs < y. 


Proof. Let cs’ = cs + [1 — Xic,]. Then docs’ = 1. Now yc’ has the properties 
specified: 


Lv) =v >x 
(observe that [y, 1] is lattice isomorphic to [0, Y] by (2.1), hence upper 
N-continuous, and use Theorem 4.2); 


x(ycp) = xcg = x(Dic,)cp = x(cg + 0) = O. 
COROLLARY. Suppose x is an N-residual element in L and L is &'-continuous 
for eery X&' <BR. If yO Y = 1 with x < y and (0, Y] upper N-continuous 
then x is an X-residual element in (0, y}. 


Proof. By hypothesis, x is the residual element of some residually indepen- 
dent family {ag; 8 < Q}. 

Define cs = > (a,; y < 8). Then xcg = 0 for each 8 since L is upper X’- 
continuous for all &’ < &. And x < ics since dics = Diag > x. 

Hence Lemma 7.2 shows that an increasing family {és} exists with }>és» < y, 
xés = 0 for every 8 and x < Sé,s. By Theorem 3.6, applied to (0, y], x < ¢ for 
some t which is an X-residual element in [0, y]; hence x itself has this property, 
by Theorem 3.3. 


THEOREM 7.1. Suppose x is an X-residual element in L with (0, x] upper 
XN-continuous. If L is &'-continuous for all XR’ < & then x is a member of a 
homogeneous sequence. 


Proof. 1. By Theorem 6.1 it is sufficient to show that x is a member of 
an independent sequence of pairwise perspective elements. 


2. It is sufficient therefore to prove that if {x, x:,..., Xm} is an independent 
family of pairwise perspective elements and m > 0, then there exists some 
Xm41 ™ x such that {x, x1,...,Xm4i} is independent. 


3. We choose Y = x1 +...+%x, and y = [1 — Y] with y > x. 








508 ICHIRO AMEMIYA AND ISRAEL HALPERIN 


Since each [0, x,] is upper N-continuous, along with [0, x], so is [0, Y], by 
Theorem 4.3. Hence, by the Corollary to Lemma 7.2, x is a residual element 
in [0,y). Now the Corollary to Lemma 7.1, applied to [0,y] shows that 
xX ™~ Xm+1 for some Xm4i1 < y with xx_41 = 0. Then 


Xm4i(X + XH... + Xm) = Xmgry(X + 1 +... + Xm) = Xmgi(x + 0) = 0 


so {X,*X1,...,Xm4i} is independent. 
This x4: satisfies our requirements and this completes the proof of Theorem 


7.1. 
Coro.iary 1. Jf L is finite and locally X-continuous then L is X-continuous. 


Proof. We prove this by transfinite induction. Hence we can suppose L is 
X’-continuous for all &’ < X&. 

By Corollary 7 to Theorem 3.6 it is sufficient to show that every X-residual 
element ¢ must be 0. 

But if ¢ # 0 then for some non-zero x with x < #, [0, x] is N-continuous 
(a fortiori, upper X-continuous). Then, by Theorem 3.3, x is also an X-residual 
element. Now Theorem 7.1 shows that x is a member of a homogeneous 
sequence. 

But L is finite, so x = 0. This gives a contradiction and shows that t ~ 0 
is impossible. Thus Corollary 1 must be valid. 


Remark. The proof of Theorem 7.1 shows that if L is finite and locally 
upper XN-continuous and X’-continuous for all X’ < &, then L is upper 
N-continuous. When X = XN this becomes: if L is finite and locally upper 
No-continuous then L is upper No-continuous. 


COROLLARY 2. If in L every homogeneous sequence is of type (A) and L is 
locally upper No-continuous then L is upper No-continuous. 

(Note: the Remark following Corollary 1 to Theorem 7.1 uses the stronger 
condition that L be finite.) 


Proof. Suppose, if possible, that L is not upper No-continuous. Then there 
exists some ¢ ~ 0 with ¢ an N>-residual element in L. 

By the hypotheses, there exists an x ~ 0 with x <¢ and [0,x] upper 
N.-continuous. By Theorem 3.3, x itself is also an No-residual element in L. 

Now Theorem 7.1 shows that x is a member of a homogeneous sequence 
{x, Yo, ¥1,.--}, by the hypotheses necessarily of type (A). 

Choose Y = Dy, and y = [1 — Y] with y>~-x. Then [0, Y] is upper 
No-continuous by Theorem 5.1 so x is an Np-residual element in [0, y], by 
the Corollary to Lemma 7.2. 

Now by the Corollary to Theorem 3.2 there exists a residual C-sequence 
{(x|cn, b,)} with x + >b, < y. Since 5, is perspective to a subelement of x 
and x is perspective to y, and (b, + x)y, < yY = 0 therefore (2.2) shows 
that 5, is perspective to a subelement of yp. 








tc 
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Since x ~ yo and (x + >),)(S-y,.) < y¥ = 0, therefore (i) of the Corollary 
to Theorem 3.4 shows that x + 5d, is perspective to a subelement of }-y,. 
But [0,>>y,] = (0, Y] is upper No-continuous so [0, x + °),] has the same 
property. Hence x = x)o¢, = Domes” (XD m1"Cx) = (0) = 0, a contradiction. 
Thus Corollary 2 must be vaild. 


8. Homogeneous sequences (continued). In this section we assume 
that L is a complemented countably complete modular lattice. 


LEMMA 8.1. Suppose {a,} is a homogeneous sequence with (d_, Xn, Gn+1)C for 
every n. Then 

(i) a;(3ox,) = 0 implies {a,} is of type (A); 

(ii) a; < ox, implies {a,} is of type (A*). 


Proof of (i): We shall show that }>x, is a complement of every a, in S-a,. 
Clearly a, + }ox; = Sa; so we need only prove a,>-x, = 0 for each n. But 
the axis x; +... + X,—-1 gives a perspective mapping of [0,a,] onto [0, a;] 
and by this mapping a,> x, is mapped on (a,)0x; + %1 +... + %-1)ai < 
(Sx,)a; = 0, so a,>-x; itself must be 0. 


Proof of (ii): We need only show that {x,} is a homogeneous sequence of 
type (A*). For the Corollary 1 to Theorem 5.1 will show that [0,>°x,] is 
lower No-continuous; then [0,>-a,] will also be lower No-continuous since 
the hypothesis implies that }>a, < }-x,; then {a,} will be of type (A*), again 
by Corollary 1 to Theorem 5.1. 

To show that {x,} is homogeneous of type (A*) it is sufficient (by Remark 
1 following Definition 5.3) to prove: 


(8.1) {x,} is strongly independent, 
(8.2) a; is a complement of every x*, in }>x;. 


Now (8.1) follows from (3.1). 
To prove (8.2) we verify:. 


ax, = ai( > «.)(= xi + > s.) 


t= t=—n+1 


n—1 
= a > x,+0=0; 
t=1 


atm=atmt...tmit Dd x 


t=—n+1 


=ait...tait D x 


t=—n+1 


so we need only show 


(8.3) an < > x4 for every n. 


t=—n+1 
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But (8.3) holds for » = 1, by hypothesis. Also, (8.3) holds for = 2 since 


a:< (= «.)(a +m) < (= «.)(¥ xi + x1) 


t= t=_ 


= (= x.) - (= 0. )s(a + a2) 


tm? 


= (= x.) +0 since x;d2 = 0, 


a: < > Xi 
t—2 
By repetition of this calculation, (8.3) can be proved for all n. This shows 
that (ii) holds and completes the proof of Lemma 8.1. 


THEOREM 8.1. (i) Jf {b,}, {c,} are homogeneous sequences of types (A) and 
(A*) respectively then >-b, and >-c, are completely disjoint. 

(ii) If {a,} is a homogeneous sequence there is a unique decomposition 
a, = b,@ c, such that {b,} and {c,} are homogeneous sequences of types (A), 
(A*) respectively. 


Proof of (i): Let b = >b,, c = Sic,. We may suppose (6, c)P false and we 
need only derive a contradiction. Clearly we may suppose (replacing },, C, 
by suitable subelements) 5; ~ c,; # 0. Then x < c, (x, 6)P together imply 
x = 0. 

Let d = [c — bc]. Lemma 5.2 (with our d in place of A in Lemma 5.2) 
shows that [0, d] is upper No-continuous. And [0, dc] is upper No-continuous 
since bc < b and [0,4] is upper No-continuous. Then Theorem 4.3 shows 
that [0,c] is upper No-continuous. 

Hence, by Theorem 5.1, {c,} is of type (A). Since {c,} is also of type (A*), 
all c, are 0 (by Corollary 2 to Theorem 5.1). This contradicts c, # 0 and (i) 
is therefore established. 


Proof of (ii): We need only obtain one decomposition as described, since 
uniqueness will follow from (i). 

Since {a,} is a homogeneous sequence, (@,, X,, @n41)C holds for certain x,. 
Put x = }-x, and let 

Cy = xX, bd; = [a, = a,x] 
and for n > 1, 
Cat+i1 = (Ca + Xn)On+1 Bn+1 = (b, + Xn) Qn+1- 
Thus, Cn+1, b241 obtain from ¢c,, 6, respectively by the perspective mapping 


of [0, a,] onto [0, a,4:] with axis x,. 
It follows that {d,}, {c,} are homogeneous sequences and a, = 5, @ ¢,. 








ce 


vs 





eg 
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Now let 


yn = Xn (dn + bn+1), 2, = Xn (Cn + Cn+1)- 


Then as the reader can verify easily, (0,, Yn, bn+1)C, (Cn, Sn» Cnai)C hold and 
x. = ¥n® Zn. 

Since bi>0y,. < b:a:.-x%, = 0, Lemma 8.1 (i) shows that {d,} is a homo- 
geneous sequence of type (A). 

Since 


C1(Lye + Liza) 
C1(Ln + (er + Dyn)Detn) = Cre 
(observe: (¢: + Soyn\-zn < (Sen) (b,) = 0), therefore Lemma 8.1 (ii) shows 


that {c,} is of type (A*). 
This completes the proof of Theorem 8.1. 


Cy = €;0\x 


Coro.uary. If L is complete then L is a direct sum L, + Lz + Ls where 
L, = [0, a,] with all a, in the centre of L, Li, is upper No-continuous, L, is 
lower XNo-continuous and Ly; is finite (a;, a2,a@3 are unique if Ls is maximal 
with the finiteness property). 


Proof. Let a; = }>x where x varies over all elements perspective to mem- 
bers of homogeneous sequences of type (A), let ag = }-y where y varies over 
all elements perspective to members of homogeneous sequences of type (A*). 
Then a, a2 are in the centre of L. 

Let a; = [1 — (a; + a@2)]. Then [0, as] is clearly finite. 

Now every homogeneous sequence in [0, a] is of type (A). For otherwise 
a, # 0 and some y # 0 with y < a; would be a member of a homogeneous 
sequence of type (A*), by Theorem 8.1 (ii). Also (y, x)P would be false for 
some x perspective to a member of a homogeneous sequence of type (A) 
hence for some x which is itself a member of a homogeneous sequence of 
type (A). But Theorem 8.1 (i) shows that (y,x)P holds in such circum- 
stances. This contradiction’ proves that every homogeneous sequence in 
[0, a,] is of type (A). 

Also [0, a;] is locally upper No-continuous. For if y # 0, y < a; holds, then 
as in the preceding paragraph (y, x)P is false for some x which is a member 
of a homogeneous sequence of type (A), so [0,x] is upper No-continuous 
(using Theorem 5.1). Then for some y; # 0, y: < y, the lattice (0, y,] is 
perspective to [0, x:] for some x; < x so [0, y:] is also upper No-continuous. 
This proves [0, a,] to be locally upper No-continuous. 

Then Corollary 2 to Theorem 7.1 shows that [0, a,] is upper No-continuous. 
Similarly, using the dual to Corollary 2 to Theorem 7.1, [0, a2] is lower No- 
continuous. 


Remark 1. If L is X-completé but not necessarily complete we can show 
that L is a sublattice of such a direct sum L; + Le + Li. 
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Remark 2. If L is complete and irreducible then ZL must be upper Xo- 
continuous, or lower No-continuous or finite.’* (See the Note added at end of 
this paper.) 


9. Kaplansky’s theorem. 


THEOREM 9.1. Suppose L is a complemented countably complete modular 
lattice.'* Then L is finite if it has the property: 

(9.1) for every a # 0 there exists an anti-automorphism ¢ of L such that (a, b)P 
is false for every complement b of $(a).'® 


Proof.'* By the Corollary to Theorem 6.1 it is sufficient to show that if 
{a,} is a homogeneous sequence in L then a; = 0. By Theorem 8.1 (ii) we 
may suppose that {a,} is of type (A) or (A*). 

Suppose if possible that a, # 0. Let ¢@ be an anti-automorphism of L (as 
provided by (9.1)) such that (a;, 6)P is false whenever 6@ ¢(a;) = 1. Then 
{@(a,)}, considered in L’, is homogeneous of type (A) or (A*). 

Hence every dual sequence {5,} of {¢(a,)} is homogeneous of type (A*) or 
(A) respectively, in L. Therefore (a;, 5:)P holds by Theorem 8.1 (i). 

But 5; is a complement of ¢(a;) by the definition of dual sequence, so this 
gives a contradiction to the property assumed for ¢. 

Thus a; * 0 is not possible and Theorem 9.1 is proved. 


Remark 1. (9.1) is obviously implied by the property: 

(9.2) for every a #0 there exists an anti-automorphism of L such that 
a < ¢(a) ts false. 

Hence, if L is a countably complete, orthocomplemented modular lattice, then L 
must be finite (see the Appendix for a direct proof of this result). 


Remark 2. lf L is a complemented, complete modular lattice, then (9.1) is 
implied by the property: 

(9.2)' for every z # 0 with z in the centre of L, there exists an anti-automorphism 
@ of L such that z < $(z) is false. 

To derive (9.1) from: (9.2)’ suppose a ~ 0 and let z be the least central 
element with z > a. Then there exists an anti-automorphism @ of L (as 
provided by (9.2)’) such that z < ¢(z) is false. 


[I rreducibility for a lattice L means: L = L; + L2 (direct sum) only if ZL; or L2 consists of 
one element. If L is complemented and modular, this is equivalent to: 0, 1 are the only elements 
in the centre of L (it was shown first by von Neumann (3, Part I, Theorems 5.2, 5.3) that for 
complemented modular lattices, irreducibility in the above sense is equivalent to: 0, 1 are the 
only elements with unique complements). 

“Theorem 9.1 (and aiso its Corollary) may fail to hold if L is not countably complete. This 
failure occurs in the orthocomplemented modular lattice consisting of all the linear subspaces 
of finite dimension and their orthogonal complements in Hilbert space. 

If (a, b)P is false for one complement b of ¢(a) then (a, b)P is necessarily false for every 
complement b of ¢(a). 

16See footnote 23. 
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Since z < 2; for every central element z; > a the anti-automorphism ¢ 
yields: ¢(z) > 2, for every central element z; < ¢(a). Since ¢(z) > z is false, 
therefore z < ¢(a@) must be false, also. Hence zb ~ 0 for some complement 6 
of ¢(a). 

Using (2.8) we have: (a, 5)P is false for at least one complement 6 of 
¢(a), hence for every complement 6 of ¢(a).'? Thus (9.1) has been derived 
from (9.2)’. 


CoROLLARY.'* A complete complemented modular lattice L is necessarily 
finite if it possesses an anti-automorphism o which is an orthocomplementation 
on the centre (that is, o(z)@® z= 1 for every central element z), in particular 
if L is irreducible’® and possesses at least one anti-automorphism. 


Définition 9.1. For a lattice L the property (SD). shall mean: 
(SD): For every increasing family {cs;8 <Q} there exists a strongly 
independent family {ag; 8 < Q} such that cg = > (a,; y < 8) for all B < Q. 


THEOREM 9.2. An orthocomplemented X-complete modular lattice has the 
property (SD), 


Proof. Suppose a — ¢(a) denotes the orthocomplementation. If {cs} is an 
increasing family, choose a, = ¢,, and for 1 < 8 < Q, ag = cal 1($(c,); y < B). 

Then {ag} is strongly independent; for y < 8 implies that a, is orthogonal 
to ag; hence a, is orthogonal to a*,, [la*, is orthogonal to every as, hence to 
Laz. Since Ila*, < Sas this implies [la*, = 0, so {ag} is indeed strongly 
independent. 

By transfinite induction on 8 it is easy to show that cs = }>(a,; y < 8) for 
all 8 < Q. 

This proves Theorem 9.2. 

THEOREM 9.3. Suppose L is a complemented X-complete modular lattice with 
the property (SZ) ». If L is finite and can be doubled then L is upper X-con- 
tinuous. 4 


Proof 1. We may suppose xcg = 0 for all 8 < Q, for some Q; < Q and 
cs | 1 and we need only prove x = 0. 

Let X be a complement of x and let cs’ = (x + cs)X. Then [0, cg] is mapped 
onto [0, cs’] by the perspective mapping with axis x. 

Since L is assumed to have the property (SI) there exist strongly inde- 


pendent families {as}, {as’} such that cg = } (a,; y < 8), cs’ = La’: 7 < 8B) 
for all B < Q,. 
Then ag ~ (x + as)X (with axis x). Since ag @ > (a,; y < 8) = cg, that is, 
ag @ > (cy; y < 8) = cg therefore (by the perspective mapping with axis x), 
17See footnote 15. 


18See footnote 14. 
19See footnote 13. 
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(x + ag)X OL(Gixv< 8) =e 


so (x + as)X ~ a,’ (with axis >} (a,’;7 < 8) = X(c’;7 < 8)). 

Therefore ag = ag’ for each 8B < 2), ag = ics = 1 > Lay’ and L can be 
doubled. 

If x #0, then Sia,’ < X <1 so [Sas — Yag’] + O and by (ii) of the 
Corollary to Theorem 3.4, [}-as — }-a,’] is a member of an independent 
sequence of pairwise perspective elements. This would contradict the assumed 
finiteness of L. 

Hence x = 0 as required and Theorem 9.3 is proved. 


THEOREM 9.4. If L is orthocomplemented X-complete and modular then L is 
X-continuous.*° 


Proof. L is finite by Remark 1 following Theorem 9.1. Thus, by Corollary 1 
to Theorem 7.1 it is sufficient to prove local X-continuity of L. Since L possesses 
an anti-automorphism it is sufficient to prove that L is locally upper X- 
continuous. 

We may suppose x is an element of L with x # 0 and we need only show 
that for some element y, with0O # y < x, the lattice [0, y] is upper X-continuous 

Now if there exists an element 0 # y < x such that [0, y] can be doubled 
then, since (0, y] has the property (SI) Theorem 9.3 shows that [0, y] is 
upper N-continuous. 

On the other hand, if 0 # y < x implies that [0, y] cannot be doubled 
then (y, z)P holds whenever y < x, z < x with yz = 0.7! Now if {ag} is an 
increasing family in [0,x] and yas = 0 for every 8, then (y, as)P holds for 
every 8, so (y, }-as)P holds, hence y}-as = 0. This proves that [0, x] is itself 
upper X-continuous. 

This completes the proof of Theorem 9.4. 


THEOREM 9.5. Suppose L is a complemented X-complete modular finite lattice 
which possesses an anti-automorphism of period two with the following con- 
tinuity property: $(xs)® xs = 1, xg | x together imply ¢(x) @ x = 1. Then L 
is X-continuous. 


Remark. Such ¢ generalize orthocomplementation. 


Proof 1. We prove this theorem by transfinite induction on X so we may 
suppose L is X’-continuous for all X’ < X&. 


*This is a strengthened form of Kaplansky’s theorem (1). In a letter to one of us dated 
June 13, 1957, Kaplansky conjectured that any complemented complete modular lattice is 
continuous if it possesses an anti-automorphism of period two which is an orthocomplementa- 
tion on the centre. Our Theorem 9.1 establishes finiteness under even weaker conditions but 
our Theorem 9.5 establishes continuity only under conditions somewhat more restrictive than 
those of Kaplansky’s conjecture. 

"In this case [0, x] is a complemented modular lattice in which every element has a unique 
complement, that is, a Boolean algebra. 
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2. It is sufficient to prove that L is locally upper N-continuous, for the 
anti-automorphic character of L will show then that L is locally lower X- 
continuous, hence locally N-continuous. Since L is assumed to be finite, 
Corollary 1 to Theorem 7.1 will then show that L is N-continuous. 

3. Thus we may suppose a # 0 and we need only prove that (0, d] is 
upper N-continuous for some 0 # b < a. We shall prove below: 

(i) If ¢(x) > x is false for every 0 # x < a then [0, a] possesses an ortho- 
complementation u—a@(u) (Theorem 9.4 then shows that [0, a] itself is 
XN-continuous). 

(ii) If ¢(x) > x holds for some 0 # x < a then (0, x] is upper N-continuous. 

4. To prove (i): We note that 

o(xo(x)) = o(x) + x > xo(x). 
Since x@(x) < x the assumption of (i) implies that x@(x) = 0 for all x < a. 
Then also x + ¢(x) = 1. Thus x + ad(x) = a, ad(ad(x)) = a(do(a) + x) 
= x + a¢(a) = x so x ~ a¢(x) is an orthocomplementation on [0, a]. This 
proves (i). 

5. To prove (ii): We may suppose ag T x(8 <Q), yas = 0 for all 8 and y < x; 
we need only prove y = 0. 


We can choose bg, by transfinite induction on 8 so that b; = [x — a,], and 
for B > 1, 


bs = [I1(b,; 7 < 8) — agl1(b,; 7 < 8)], 


and so that bg > y for all 8.22 Then ag@ bg = x for all 8 since (0, x] is lower 
N’-continuous for all X’ < &. 

Now let X be a complement of x. Set cs = X@(ag). Then cg 1, eax < Xx = 0 
and ¢s@ x = X¢(as) + x = o(as)(X + x) = o(ag) for all 8. Hence 


(bs ® cs) ® ag = $(az), (bs + cs)o(as) = ag, 
(bs + cp)b(bs + cs) = (bg + cp)b(as)o(bs + ce) = (bs + cp)as = 0, 
so - ’ 
(bs +-¢s)o(bs + cs) = 0 


for all 8. 
Suppose ¢(bs + cs) T d. Then do(d) = 0 and (bg + cs) | o(d). Since y < bp 
for every 8, therefore y < I1(bs + cs), that is, y < (d); but also 


o(d) < 11 (bs + cs) < [1¢(as) = o(Xas) = o(x) 


so d > x. Thus y < x¢(d) < d¢(d) = 0 as required. 
This proves that [0, x] is upper N-continuous and completes the proof of 
Theorem 9.5. 


Choose b, = y + [x — (y + a1), 
bp = y + [I(by; y < 8) — (y + as) (by; y < 8). 
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APPENDIX ON FINITENESS 


THEOREM.” In an orthocomplemented countably complete modular lattice every 
independent™ sequence of pairwise perspective elements must have all its elements 
zero. 


Proof. 1. We call an infinite sequence {x,; > 1} residually independent if 
Xnd_ (Xi; 4 # m) = O for all n, strongly independent if IL> (x;; i ~ n) = 0. Since, 
for every p, 


xpd (x134 ¥ p) <ILDX (x1; i ¥ n) 


strong independence implies residual independence. 

We note that if the x, are pairwise orthogonal then {x,} is strongly inde- 
pendent. Also, if {x,} is strongly independent and for every n, y, < >> (x;;i>n), 
Ynd_(Xi;4 > m) = 0 then {y,} is also strongly independent. If {x,} is strongly 
independent then so is {(x*,)4+}ox,} where x*, denotes }>(x,;i # m) and x4 
denotes the orthogonal complement of the element x. 

2. We may suppose {a,} is an independent sequence of pairwise perspective 
elements with a, # 0 and we need only derive a contradiction. By replacing 
a, for nm >1 by (a1 +... + 4@,)(@1 +... + Ga-1)+ we may even suppose 
that {a,} is strongly independent. 

3. By using suitable replacements for the a, we may even assume that 
they have a common relative complement A, that is, a, + A = a,,a,A = 0 
for all .*5 To see this, suppose a, ~ d,+4; (axis x,), that is, a, +x, = 
Ons + Xn = An + Anti, AnX_ = An4iX, = 0. Let x = }ox,. We must consider 
two cases: xa; ¥ a; and xa; = a). 


*This theorem (first proved by Kaplansky (1, Theorem 1), see footnote 25) is contained in 
our Theorem 9.1 (see Remark 1 following Theorem 9.1) but we give here a direct (lattice- 
theoretic) proof for this orthocomplemented case which can be read independently of the rest 
of this paper provided the reader has some slight familiarity with complemented modular 
lattices. 

With slight modification this direct proof actually establishes Theorem 9.1 in full generality. 

“A family {x,;\ € A} is called independent if for every finite subset FCA, 
Xp2(x,;4 € F) = O whenever u q F. 

**Kaplansky constructs a common relative complement A for every sequence {a,} of pairwise 
orthogonal and perspective elements (of course, the Theorem will show finally that all a, must 
be 0). 

Kaplansky'’s method is as follows: first, he replaces L by [0, 2 a,[. Then he shows that 
{ 2 (aun4i; m > 1);i = O, 1,2,3} isa homogeneous basis of order 4 in the sense of von Neumann 
(3, Part II, Definition 3.1). Therefore L can be identified with the lattice of principal right 
ideals of a suitable regular ring 9, by the coordinatization theorem of von Neumann (3, 
Part II, Theorem 14.1). 

Since L is orthocomplemented, there exists a conjugation operation in R: x — x* (that is, 
(x + y)* = x* + y*, (xy)* = y* x* and x** = x) such that every lattice element a in L is of 
the form (¢), with e a unique idempotent which is Hermitian (that is, e* = e) and then 
a+ = (1-e), (all proved in von Neumann (3, chapter Il, Theorem 4.5) ). 
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If xa; # ay, let a,’ be a relative complement of xa; in aj, let a,4,;’ be obtained 
from a,’ by the perspective mapping with axis x,, that is, @.4;’ = (a,' + X»)@n41 
and let x,’ = xn(@n’ + Gn41’), x’ = Dox,’. Then {a,’} is strongly independent, 
pairwise perspective and the a,’ have x as a common relative complement. 

If xa, = a;, then {x,} is strongly independent, pairwise perspective and 
the x*, = )0(x;;i # m) have a; as common relative complement. Hence 
{(x*,)+} is strongly independent, pairwise perspective and a;* is a common 
relative complement. 

4. Suppose now that our strongly independent sequence of pairwise per- 
spective elements is written as two sequences a,, },, and let A be their common 
relative complement. The strong independence of the set {all a,, all 5,} implies 
(Lan) (25,) = 0. 

Then a, ~ da41 with axis c, = A(a, + da41) and a;>-c, = 0. 

5. We shall show below that there exists a residually independent sequence 
of pairwise perspective elements {b,’} with },’ ~ b;, (56,’)(Sa,) = 0 and 
axes of perspectivity d, between 6,’ and },.;’ such that 6; < }d,. 

6. We will then derive a contradiction as follows: a; ~ b;, 6; ~ 6;’ with 
a;(b; + b;’) = 0 implies a, ~ 5,’. Let t; be an axis of perspectivity for a, by,’ 
and define ¢,,; for m > 1, by induction as follows: 


bai = (tn +a + d,) (dn + b;). 


Lett = >t. Thent + Sod, = t + Doc, and (use: for each n, {a,, 5 (a,,; m > n), 
b,, >. (bn; m > n)} is independent since {a,,} and {6,} are both residually 
independent) : 


>a, = (1, + 4 t.)(a, + . a.) = ta, + (= mos a.) 


t=_2 t=—2 


=0+ (x W(X a.) Pru (= a.) = 0. 


tm? t—2 


’ 


Thus the given a, must be of the form (en), with all ¢, idempotent, Hermitian and 


€n €m = O for n # m. 

Kaplansky now constructs elements in R, namely w, ex, e:: (for i > 1) such that: e.:; = e1; 
for all 4, ex = €1€10 Gi, Cx Cc. = Cy) and Gy, ex = G3; W = Cyw and we; = ey for all é (see 
Kaplansky (1, Lemma 21). 

Now (w)', the set of all u such that wu = 0, is a principal right ideal, as shown by von 
Neumann (3, Part II, Lemma 2.2). 


This (w)" is a common complement of the a,. For w (e:; — ¢:) = 0 for every 4; so for every j, 
(e;)» + (w)’ contains ¢; ¢;; + (ej: — €1)(— 1) = e:, hence it contains also, for every i, 
(i: — €1) + €1 = G41, hence also e;; ex, = e&. Thus 


(€;)e + (w)" = 1 for all j. 
Finally, if « is in both (e;), and (w)* then u = e;u and wu = 0; that is, weju = ¢,; u = 0, so 
u = €;, (€,; 4) = 0. This means the meet of (¢;), and (w)* is 0, and proves that (w)’ is a 
common complement of all a, = (é,)-. 
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bi = bided, = bi(t + Yc.) (t + a1) = Bi(t + (Xena) 
= bi(¢ +0) = dit = dit, = 0. 


This implies a, = 0 and gives the desired contradiction. 

7. Thus we need only to construct {d,’} as described in 5 above. Since 
Ab, is a common relative complement of the 5, we may suppose >}, = 1 
and write A again in place of A}-},,. Now let a denote A+ and let 6, denote 
(d*,)+. 


We shall prove that the family {a, b:, be, .. .} are pairwise perspective and 
independent, 6; < a +¥(6,;i > 1), and 6; ~ d). 

Indeed: 

* = 
(i) (iy =4+(D 5-440 heatnet 
men men 

since {b,} strongly independent implies that II,,.,5*,, = 5,, and so ab*, = 0. 
(ii) (a + b,)* = Ab, = 0,s0a + by = 1. 

Since {6,} is strongly independent, (i) shows that {a, 5,, 6:,...} is inde- 


pendent. Then (i) and (ii) show that a ~ 6, with axis 5*, so all of {a, 5,, be, . . .} 
are pairwise perspective, and 5, < a + 6*, = 1. Finally, 6; ~ 6, with axis b*). 

Thus the 8,’ will be available, as described in 5. if we prove the following 
“orthogonalization”’ lemma. 


LemMA.” Suppose { bo, fi, fo, ...} ts independent and pairwise perspective. Then 
there exists a sequence bo, b;, be, ... such that {b,;n >0O} is residually independent, 
and for n > 0, ba-1 ~ 6, with axis d,, so that fi +... +fm = di t...+ dm 
for every m (in particular if bb < >> (fm; m > 1) then bo < ¥ (dn; m > 1)). 


Proof. 1. Choose d; = f;. Then bo and f; have some axis of perspectivity u 
and we choose b; = (bo) + f;)u. Then ” 


bo @ dy = ,@ d; = bo ®@ Di. 
Hence we can choose By, a complement of bo, so that 


By = b; + {1 = (b; a d;)]}. Let B_, = |. 


**This ‘‘orthogonalization”’ lemma is proved here for every complemented countably complete 
modular lattice. Even countable completeness need not be assumed if residual independence 
of {x,} is defined to mean: for every n there exists an element X, such that x, X, = 0 and 
Xn > Xm for all n < m. 

*¥x @ y denotes the lattice union x + y but implies that the meet xy is 0. When x < y the 
symbol [x — y] denotes an arbitrary but fixed relative complement of y in x, that is, 
[ix -y@y ==. 
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2. We may suppose that for some r > 1, the following statements hold: 





ee = ae Se 
have all been defined so that: 
b,-1 ~ 6, with axis d, form = 1,...,7r; 
(W), | aq+...¢d,=fit...+f, fors=1,...,7; 
1 = bo ® Bo; Bo = 6, @ Bi;... ; Bre = 6-1 @ By-1; 
B,1 > 5, 


and we need only show how to define d,41, 5,41, B, so that (W),4: holds. (Ob- 
serve that (W), does hold for the d;, b;, Bo defined in 1 above.) 
3. Choose dj4; = Byii(fi +... + fr4s). 
Then 
(i) dit... + dai = (it... +d, + Brs)(fit... + fri) 
= (q,+...+4d,+6,+ Buaid(fit...+frsr) 
=(di+...d,+ 51+ B,-1) (fi +... + frst) 
ba +... dens + Bodhi t... + Serr) 
=...= Bi(fit...t+fear) =f... thea 


(ii) d+41(bo + by +...¢ b,) —_ d,41b, 
—_ dra (fi + see + fr41) br (bo + fi + see + f,) 
= dy416-(fi — eee + f,) = 416, (d a eee aa d,) = 0, 
drailfi + eee + f;) < d,41(do + by + +e + b,) = 0, 
80 dra ™ frags (axis fi +... +f, =dit...+d,). 
But ie ~ bo and bof rs + d++1) < bo( fi TF oe. + firs) = 0 so dr41 ~™ bo. 
Now {bo, b:,...,6,} is independent, b.1:.~ 5, for m<r, and 
d+41(bo + see + b,) = 0, so dy41 ~~ b,. 
4. Since d,,; ~ 6,, there exists an axis u such that 
- b,@ u = dy4i@® u. 
We choose 6,4; = (b,@ d,i1)u. Then we have 


5,® dy41 = 6,1@ dr41 = b,® bras 


so b, ~ b,41 with axis d,+1. 
5. Since b, + d,41 < B,-1 we can choose 


B, = D414 + [B,-1 — (6, + d,+1)]. 


Then B, > b,4:, and B,_; = 6,@ B,. 
Thus (W),4+: is satisfied and so the Lemma is proved and hence the Theorem 
is proved. 


Note added in proof. A recent paper by Ornstein (Dual vector spaces, 
Ann. Math., 69 (1959), 520-34) obtains the following result (his Corollary 5.1): 
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Suppose L is a complete, atomic, centreless, complemented, modular lattice in 
which 1 is the union of a countable number of atoms and O is the intersection of a 
countable number of co-atoms; then L is either isomorphic or anti-tsomorphic to 
the lattice of all subspaces of a vector space of countable dimension. 


Ornstein’s result can be deduced also from Remark 2 at the end of our § 8. 
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SUR UN THEOREME DE G. SZEKERES, CONCERNANT 
LES FONCTIONS MONOTONES ET CONVEXES 


S. MARCUS 


1. Dans une note récente (1), Szekeres a donné le théoréme suivant: 


Si f est une fonction réelle, définie, strictement croissante et ayant la deuxidme 
dérivée continue sur (a, b) (— ~ € a <b < @&), alors il existe une fonction 
réelle , définie et strictement croissante sur (a, b), et une fonction réelle py, définie 
et strictement croissante sur (¢(a), o(b)), telles que o''(x) > 0 pour a < x < b, 
v'"(u) < 0 pour o(a) < u < o(d) et f(x) = W(o(x)) pour a <x < b. 


Dans la démonstration de ce théoréme, Szekeres utilise une expression od 
la dérivée f’(x) apparaft au dénominateur, ce qui exige qu’on ait f’(x) # 0 
pour a < x < b, condition qui ne figure pas dans I|’énoncé du théoréme. I! 
est naturel alors de poser le probléme si le théoréme ci-dessus reste en vigueur 
si cette condition n’est pas remplie. Il y a lieu aussi de se demander si les 
autres conditions, par exemple l’existence et la continuité de f(x) sont néces- 
saires pour qu'une fonction strictement croissante puisse @tre représentée 
comme une superposition de deux fonctions croissantes, |’une convexe, l'autre 
concave. Dans ce qui suit nous allons donner une réponse a ces questions. 
Nous allons voir non seulement que le théoréme cesse d’é@tre vrai si f’(x) 
s’annule (méme en un seul point de (a, b)), mais qu’on a aussi d’autres con- 
ditions nécessaires, de nature assez restrictive, pour que f admette une représen- 
tation du type que nous venons de parler. D’autre part, nous allons établir 
des conditions suffisantes assez voisines de celles nécessaires, de sorte qu’une 
amélioration de ces conditions ne parait pas possible sans compliquer les 
hypothéses—hors mis le cas od les conditions nécessaires établies par le théo- 
réme 1 sont aussi suffisantes, question que nous ne savons pas résoudre. 


2. Une fonction réelle g est cémvexe (concave) sur (a, b) si poura <x <y <b 
ona 





feg2) <a (42), ape). 


Considérons une fonction réelle f, définie sur (a, b) (— ~- ¢€a<b< @&), 
Supposons qu’il existe une fonction réelle ¢, définie, convexe et strictement 
croissante sur (a, 6) et une fonction réelle y, définie, concave et strictement 
croissante sur (¢(a), (b)), telles que 


f(x) = ¥(¢(x)) poura < x <b. 
Nous dirons en ce cas que f est une fonction du type K. 
Regu le 25 novembre, 1958. 
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Si pour une fonction réelle f il existe, en un point x, la dérivée A droite 
(gauche) finie, nous désignons cette dérivée par f+(x)(f-(x)). 


THEOREME 1. Si une fonction réelle f, définie sur (a, b), est du type K sur 
(a, b), alors les fonctions f(x) et f-(x) sont définies sur (a, b). Chacune de ces 
deux fonctions est dérivable presque partout sur (a, b) et la dérivée est som- 
mable sur tout intervalle compact contenu dans (a, 6). Poura <x < bona 
f(x) > 0 < fr(x). 


Démonstration. Les fonctions ¢ et y, étant monotones, admettent des 
points de continuité donc, en vertu de la convexité de ¢ et de la concavité de 
¥, @ et ¥ sont continues en chaque point de (a, b) resp. (¢(a), (6)). D’aprés 
un théoréme classique, ¢~(x) et @*(x) existent en chaque point de (a, 5), 
¥(u) et ¥*(u) existent en chaque point de (¢(a), ¢(b)). On a donc, en vertu 
de la monotonie stricte de ¢, 


f(x) = W((%)) . &(%), FAC) = W*(O()) . OF (x) Sia Kw < . 


En vertu de la convexité de ¢, ¢~, et ¢+ sont des fonctions croissantes sur 
(a, b). En vertu de la concavité de y, y~ et y* sont des fonctions décroissantes 
sur (¢(a), o(b)), donc ¥~(¢(x)) et ¥*+(o(x)) sont décroissantes sur (a, 6). Par 
suite, chacune des fonctions f~ et f* est un produit de deux fonctions mono- 
tones, donc de deux fonctions dérivables presque partout. II s’ensuit que f- 
et ft sont dérivables presque partout sur (a, db). 

Soit maintenant (a, 8) un intervalle compact contenu dans (a, 5). ¥~(x), 
¥~(o(x)), o*(x) et ¥*(¢(x)) sont monotones et bornées sur (a, 8), donc f- 
et f+ sont a variation bornée sur (a, 8). Mais d’aprés un théoréme bien connu, 
telles fonctions admettent des dérivées sommables sur (a, 8). 

f, comme superposition de fonctions strictement croissantes, est stricte- 
ment croissante sur (a, b) donc, en vertu des théorémes connus (par exemple 
celui de Scheefer (2, p. 138)) f+ ne peut s’annuler identiquement en aucun 
intervalle. Il s’ensuit que ¢*(x) et ¥*(¢(x)) ne peuvent s’annuler identique- 
ment en aucun intervalle contenu dans (a, b) donc ¢*, qui d’aprés le raison- 
nement ci-dessus est croissante sur (a, b), est méme strictement croissante sur 
(a, 6). On constate aussi que ¥*(¢(x)) est strictement décroissante. D’autre 
part, en vertu de la monotonie croissante des fonctions @ et ¥, on a $*(x) 
> 0 < ¥*(¢(x)) pour a < x < Bb et, en vertu de la monotonie stricte de 
+(x) et ¥*(o(x)), que nous venons d’établir, on conclut: ¢*+(x) > 0 < ¥*(¢(x)) 
poura <x <b. 


D’une maniére analogue on prouve que ¢~(x) > 0 < ¥~-(¢(x)) poura <x <b 
et le théoréme 1 est complétement démontré. 

Un exemple d’une fonction f qui satisfait toutes les hypothéses du théoréme 
de Szekeres mais ne remplit pas la condition f-(x) # 0 # f+(x) est donné par 
f(x) = x*, considérée sur n’importe quel intervalle ouvert contenant I’origine. 
En effet, on a f’(0) = 0. Donc, en vertu du théoréme 1, f(x) = x* n’est du 
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type KX sur aucun intervalle (a, 5) tel que a < 0 < b, c’est-A-dire cette fonc- 
tion ne vérifie pas la thése du théoréme de Szekeres. 


3. Maintenant se pose le probléme: les conditions nécessaires données par 
le théoréme 1 sont-elles aussi suffisantes pour que la fonction f soit du type K? 
Nous ne savons pas. Mais nous allons montrer qu'il suffit d’ajouter, aux con- 
ditions nécessaires établies par le théoréme 1, la condition que f+ soit dérivable 
non seulement presque partout, mais partout sur (a, 5), pour que f soit du 
type K. 

LEMME. Si g est une fonction réelle croissante et absolument continue sur 
(a, b) et si @ est une fonction réelle absolument continue sur (g(a), g(b)), alors la 
fonction \ donnée par (x) = o(g(x)) est absolument continue sur (a, b). 


Nous laissons au lecteur la démonstration de ce lemme. 

Rappelons qu'une fonction f est absolument continue au sens généralisé sur 
(a, b), si elle est continue sur (a, }) et s'il existe une décomposition (a, d) 
= o’F,, ot E, sont des ensembles tels que f soit absolument continue sur 
E,, quel que soit m (3, p. 223). 


THEOREME 2. Soit f une fonction réelle dénbie sur (a,b) (— ~= Ca<b< @). 
Supposons que f est dérivable sur (a, b) et que f'(x) > 0 pour a < x < b. Sup- 
posons encore que f''(x) existe, finie, en chaque point de (a, b) et que f"’ est som- 
mable sur chaque intervalle compact contenu dans (a, b). Alors, f est, sur (a, 6), 
une fonction du type K. 


Remarque. Pour comprendre la situation du théoréme 2 par rapport au 
théoréme 1 il faut rappeler le théoréme de Dini, d’aprés lequel, si f* est con- 
tinue en x, alors f’(x) existe et est finie (voir, par exemple, (3, p. 204)). 


Démonstration du théoreme 2. On va utiliser l’idée de la démonstration du 
théoréme 1 de (1). 
La fonction 
"(= ve) si f"(x) > 0 
vor Osi f"’(x) <0 
est sommable sur chaque intervalle compact contenu dans (a, 6). Par hypo- 
thése, f’(x) est continue et strictement positive sur (a, 5). Il existe donc pour 
chaque intervalle compact J C (a, 6) un nombre p(/J) > 0 tel que 


1 
—— < f’ < p(J 
p(J) f (x) p( ) 
pour x € J. Il s’ensuit donc que I’intégrale 





_ ("fe@) 
p(y) = le f' (x) dx 


od d est un point fixe de (a, b), existe, au sens de Lebesgue, pour chaque y 
tel quea < y < Bb. 
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La fonction donnée par p(y) est continue, strictement croissante et stricte- 
ment positive sur (a, 5). Il s’ensuit donc que la fonction 


$(x) = f ody 


ot l’intégrale est prise au sens de Cauchy, est strictement croissante et con- 
vexe sur (a, 5). 

Désignons par y |’inverse de la fonction ¢. y est une fonction définie sur 
(¢(a), o(6)), strictement croissante et concave sur (¢(a), $(b)) et ayant les 
valeurs dans (a, 5). Le théoréme 2 sera démontré dés qu’on aura prouvé que 
la fonction donnée par 


h(u) = f(¥(u)) 


est concave sur (¢(a), ¢(b)), car on a en tout cas, en vertu de ce que y et ¢ 
sont des fonctions inverses, 
h(o(x)) = f(¥(o(x))) = f(x). 

Mais ¢ est une fonction dérivable, 4 dérivée strictement positive sur (a, ), 
donc y est dérivable sur (¢(a), #(b)). Il s’ensuit que # est une fonction dérivable 
sur (¢(a), ¢(6)). Pour montrer que h est concave, il suffit de montrer que 
h’(u) est décroissante sur (¢(a), $(b)). Le théoréme sera donc démontré si 
nous pourrons montrer que 

1° h’(u) est absolument continue sur tout intervalle compact contenu dans 
(¢(a), o(d)); 

2° h’’(u) < 0 presque partout sur (¢(a), $(d)). 

En effet, dans ce cas h’(u) sera absolument continue au sens généralisé 
sur (¢(a), ¢(b)). Mais on sait d’aprés un théoréme classique (voir, par exemple, 
(3, p. 223)) qu’une fonction absolument continue au sens généralisé est 
décroissante dés que sa dérivée approximative est presque partout < 0. 

Démontrons donc, 1° et 2°. Remarquons d’abord que la dérivée de ¢@ est 
une fonction absolument continue sur tout intervalle compact J contenu dans 
(a, 6). En effet, on a pour chaque x € (a, b), ¢’(x) = e?, od p(x) est crois- 
sante et, étant une intégrale de Lebesgue, est absolument continue sur J. 

Remarquons ensuite que la borne inférieure de ¢’(x) sur J est strictement 
positive. D’autre part, y étant continue et concave, est absolument continue 
sur ¢(/J). En tenant compte que y est croissante, on déduit, en vertu du 
lemme, que ¢’(¥(u)) est absolument continue et strictement positive sur ¢(/). 
Il s’ensuit donc que 

1 
, 
Vv) = Sum) 
est aussi absolument continue sur ¢(/J), qui peut étre considéré comme un 
intervalle compact quelconque contenu dans (¢(a), $(b)). f’(x), ayant une 
dérivée finie sur J, est absolument continue au sens généralisé sur J. La fonction 


F(x) = frau 
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od a est l’extrémité gauche de J, est absolument continue sur J. Mais les 
dérivées des fonctions f’(x) et F(x) coincident presque partout sur J donc, en 
vertu d’un théoréme connu (3, p. 225) ces fonctions différent par une con- 
stante. I] s’ensuit que f’(x) est absolument continue sur J. 

y, étant continue et concave, est absolument continue sur tout intervalle 
compact contenu dans (¢(a), ¢(b)). En vertu de ce que y est croissante, on 
peut faire usage du lemme et conclure que f’(y(u)) est absolument continue 
sur ¢(J). De ce que A’(u) = f’(W(u)) . ¥/(u) on déduit alors que h’ est absolu- 
ment continue sur (J). 

Démontrons maintenant que h’’(u) < 0 presque partout sur (¢(a), $(d)). 
L’existence, presque partout, de h’’(u), est assurée par le fait que h’(u), étant 
absolument continue, est 4 variation bornée sur tout intervalle compact 
contenu dans (¢(a), ¢()). On doit montrer que 


h”(u) = f"(y(u)) . Y*(u) + f’'(u)) . W’(u) < 0 
presque partout sur (¢(a), ¢(6)). Mais on a 
£ *(¥(u)) 
oe" (¥(u)) * 


En posant x = ¥(u) et en remarquant que chacune des fonctions ¢ et y, 
en vertu de leur continuité absolue, transforme chaque ensemble de mesure 
nulle dans un ensemble de mesure nulle (3, p. 225) l’inégalité ci-dessus devient 


(x) or) 
7 0 
te - fi ) ,< 
presque partout sur (a, 6), donc 
f(x) - o" (x) 
f'(x) ~*~ (x) 
presque partout sur (a, 5). Mais cette gee est évidente, car on a, presque 
partout sur (a, 5), oe. 
‘ '(x) _ fe(%) 
o'(x) — f’(x) 
et, d’autre part, pour chaque x € (a, d), 
f¥(x) ~ f''(x) 
fie) ~ fi) - 


Le théoréme 2 est ainsi complétement démontré. 


W's) = - 











Remarque. L’hypothése que f’’(x) existe, finie, non seulement presque 
partout, mais partout sur (a, 5), a été posée seulement pour assurer la con- 
tinuité absolue de f’(x) sur chaque intervalle compact contenu dans (a, d). 
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SOME RESULTS ON UNIQUENESS AND SUCCESSIVE 
APPROXIMATIONS 


FRED BRAUER 


1. In the theory of ordinary differential equations, there is a strange relation- 
ship between uniqueness of solutions and convergence of the successive approxi 
mations. There are examples of differential equations with unique solutions 
for which the successive approximations do not converge (8) and of differential 
equations with non-unique solutions for which the successive approximations 
do converge (2). However, in spite of the known logical independence of 
these two properties, almost all conditions which assure uniqueness also 
imply the convergence of the successive approximations. For example, the 
hypotheses of Kamke’s general uniqueness theorem (5), have been shown 
by Coddington and Levinson to suffice for the convergence of successive 
approximations, after the addition of one simple monotonicity condition 
(4). There is one counterexample to this “principle,” a generalization of 
Kamke’s result, to which another condition in addition to a monotonicity 
assumption must be added before convergence of the successive approximations 
can be proved (2). 

In this paper, we shall prove a pair of theorems, generalizing the results 
of Kamke, and Coddington and Levinson, and conforming to the “principle” 
mentioned above. The generalization here is in a different direction from that 
given in (2), but we shall alsoindicate how the results obtained here can be 
formulate and proved in the more general setting used there. 

The statements of our results involve a pair of conditions, each relating 
to a first order differential equation. We assume the existence of two functions, 
controlling, in a sense, the behaviour of the solutions of each one of these 
equations, and a relation between fhe growths of these functions near the 
origin. The complexity of the, results is compensated for by the large number 
of special cases which can be obtained by appropriate choice of these 
functions. 


2. We consider the initial value problem 
(1) x’ = f(t, x), x(0) = 0, 


where x and f are n-dimensional vectors. As usual, the norm |x! of a vector x 
will denote the sum of the absolute values of the components of x (3). Let 
v(t, r) (¢ = 1, 2) be continuous non-negative functions defined for 0 < ¢ < a, 
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r > 0, which are monotone non-decreasing in r for each fixed t. We will always 
assume that f satisfies the pair of conditions 

(2) lf (¢, X;) _ f(t, X:)| < ¥.(t, IX, —_ x.|), (4 = a 2), 
for (¢, X:) and (¢, X2) in a region 0 < t < a, |x| < b. 


THEOREM 1. Let f(t, x) be continuous and satisfy (2) in a region 0 <t <a, 
|x| < b. Suppose A(t) and B(t) are functions on 0 < t < a, with A(O) = B(O) 
= 0, such that 


(3) lim A (t)/B(t) = 0. 
140 

Suppose also that all solutions u(t) of 

(4) u’ = Wilt, u) 


with u(0) = 0 obey u(t) < A(t) on 0 < t <a, and that the only solution v(t) 
of 


(5) v’ = y(t, v) 
on 0 < t <a such that 


(6) lim v(t) /B(t) = 0 
40 
is the trivial solution. Then there is at most one solution of (1) on 0 <t <a. 


Proof. Suppose there are two solutions x,(¢) and x2(#) of (1) on 0 <t <a, 
and let m(t) = |x,(¢t) — x2(t)|. Then 


(7) |m’(t)| < |x:'@ — x2’ < |£(¢, x1) — FC, x2()| < vil, m(), 
using (2). Suppose there exists ¢, 0 < o < a, such that 
(8) m(c) > A(c). 


Then there is a solution u(t) of (4) passing through the point (¢, m(c)) and 
existing on some interval to the left of ¢. As far to the left of o as u(t) exists, 
it satisfies 


(9) u(t) < m(t). 
To prove (9), we observe that 
(10) u’ = W(t, u) + €, u(c) = m(o), 


has solutions u(t, e) for all sufficiently small « > 0, existing as far to the left 
of o as u(t) exists, and lim,,o,u(t, «) = u(t) (5, p. 83). Thus it suffices to 
prove 


(11) u(t, «€) < m(t), 


for all « > 0 and all solutions u(t, «) of (10). If this inequality does not hold, 


-_- 
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there is a least upper bound ¢ of numbers ¢t < ¢ for which (11) is false. Since 
m(c) = u(c) = u(oc, €) and the functions m(t) and u(t, e) are continuous, 


(12) m(f) = u(f, e), m'(t) > u’(f, €). 
Then 


vilS, m(S)) + = Wil, wf, €)) +e = w'(S, €-) < m'(S) < Wilf, md), 


using (7), (10), and (12). This contradiction proves (11), which, as we have 
remarked, implies (9). 

The solution u(t) can be continued to ¢ = 0. If u(c) = 0 for some c,0 < 
c¢ <, we can effect the continuation by defining u(t) = 0 for 0 < t < ¢; 
otherwise (9) ensures the possibility of the continuation. Since m(0) = 0, 
lim,,0u(t) = 0, and we define u(0) = 0. Now we have a solution u(t) of (4) 
with «u(0) = 0, and by hypothesis u(t) < A(t). However, in view of (8) 
and the definition of u(t), we have u(¢) > A(c), a contradiction which proves 


(13) m(t) < A(t), 0<t<a. 


To prove the uniqueness of solutions, we must show that m(t) vanishes 
identically on 0 < ¢ < a. To complete the proof, we must show that this is 
implied by (13). Proceeding as before but using ¥2 in place of ¥:, we obtain 
|m’(t)| < W2(t, m(t)). The assumption that m(r) > 0 for some 7,0 <r <a, 
yields, by the same argument as before, a solution v(t) of (5) onO0 Cc t<r 
such that v(r) = m(r), 0 < v(t) < m(t), v(0) = 0. Then 


0< lim v(t)/B(t) < lim m(t)/B(t) < lim A(t)/B(t) = 0, 


using (3) and (i3). But by hypothesis this implies that v(t) is identically zero, 
which contradicts v(r) = m(r) > 0, and therefore m(t) vanishes identically 
on 0 < t < a, which completes the proof of the theorem. 


3. The successive approximations to the solution of (1) are defined by 


(14) Xo(t) = 0, Xj4:(t) = Jt. x,(s))ds, (j= 0,1,...). 


THEOREM 2. Let f(t, x) be continuous in a region 0 <t <a, |x| < 6 and 
bounded in norm by M in this region. If the hypotheses of Theorem | are satisfied 
then the successive approximations (14) converge uniformly on the interval 
0 <t <a, where a = min (a, b/M), to the unique solution of (1). 


Proof. It follows easily from the definition (14) of the successive approxi- 
mations that they satisfy |x,(t:) — x,(te)| < Mt: — te| in the interval 0 < t 
< a. This implies that the sequence {x,(#) } is equicontinuous on this interval. 
Taking t, = 0, we have |x,(t:)| < Mt, < 6, and thus the sequence is also 
uniformly bounded. Therefore there exists a subsequence {Xj«(t)} which 
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converges uniformly to a function x(#) on 0 < ¢ < a. Because of (14) and the 
continuity of f, the sequence {X,<,:(¢)} converges uniformly to a function 


x*(¢) = ft. x(s))ds. 


We will prove that x,,;(¢) — x,(t) ~0 as j— @. This will imply, because of 
(14), that x(¢) = x*(t), so that x(t) is a solution of (1). Since the solution of 
(1) is unique, every convergent subsequence of {x,(¢) } converges to a solution, 
hence to the same solution, which shows that the original sequence converges 
to x(t) on 0 < t < a. Because of the equicontinuity of the sequence, this 
convergence is uniform. 


We define 


w,(t) ™ X541(t) —x,(t), m(t) = lim sup|w,(¢)|. 


Then m(0) = 0, and m/(t) is continuous on 0 < ¢ < a, since it is the upper 
limit of a uniformly bounded equicontinuous sequence of functions. We must 


show that m(t) vanishes identically on 0 < ¢ < a. Using (14) and (2), we 
obtain 


(15) |W y41(t-+h) —Wys1(t)| <f lf (s, X441(s)) —f(s, x,(s)) |ds 


t+h 
< J ¥i(s, |w,(s)|)ds. 


Because of the continuity of m and the equicontinuity of {w,}, given any 
e > 0, there exists an integer N(«), independent of s and j, such that 


(16) lw,(s)| <m(s)+¢, j> N(e). 
It follows from (15) and (16) that 
(17) [Wy l-+h)—Wyilt)| < fone, m(s)+e)ds, j > N(e). 
It follows easily from the definition of m that 
|m(t+h)—mi(t)| < lim sup|W s+1(t-+h) —W y41(¢)|. 


Combining this with (17) and then letting « — 0, we obtain 


(18) lm(t +h) — m(t)| < f ¥i(s, m(s))ds, 


using the continuity and monotonicity of ¥,:(¢,r) in r. The inequality (18) 
implies that m’(t) exists on any interval (t,t + A), fort > 0, and that |m’(t)| < 
¥i(t, m(t)). The argument used in the proof of Theorem 1, beginning with (7), 
proves 


(19) m(t) < A(t). 
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To complete the proof, we must show that (19) implies m(t) =0. The 
argument is much the same as the last stage of the proof of Theorem 1. Suppose 
m(r) > 0 for some 7,0 < r < a. A repetition of the first part of the proof, 
using Wz in place of yj, gives a solution v(t) of (5) on 0 < t < + such that 
v(r) = m(r), 0 < v(t) < mit), v0(0) = 0. Then 0 < lim, 0(t)/B(t) < limyzso 
m(t)/B(t) < lim, A(t)/B(t) = 0, using (3) and (19). By hypothesis, this 
implies that v(¢) vanishes identically, contradicting v(r) = m(r) > 0, and it 
follows that m(t) = 0,0 <t <a. 


4. The results of this paper were originally suggested by consideration of a 
function f(t, x) satisfying the pair of conditions 


(20) \f(t, x1) — f(t, x2)| < Clix: — x,/*, 
(21) lf (t, x1) _— f(t, X:)| < Rix, — x,|/t, 


where C, a, and k are constants with 0 < k,0 <a <1,k(1 — a) < 1. It was 
shown by Krein and Krasnosel’skii (6) that this pair of conditions implies the 
uniqueness of solutions of (1), and by Luxemburg (7) that the successive 
approximations (14) converge. In (1) these results were proved by methods 
analogous to those used here. They are special cases of Theorems 1 and 2 
obtained by taking y(t, r) = Cr*, W2(t, r) = kr/t. It is easily verified that the 
hypotheses of Theorems 1 and 2 are satisfied, with A(t) = C(1 —a)t'/C-@, 
B(t) = @, and the condition k(1 — a) <1 gives (3). Thus the results of 
Krein—Krasnosel’skilf and Luxemburg are contained in those of this paper. 

The general uniqueness theorem of Kamke (5) and the successive approxi- 
mation theorem of Coddington and Levinson (4) are also contained in this 
paper. In fact, the proofs used here are quite similar to the proofs of these 
results (3, chapter 2). The central hypothesis there is the existence of a func- 
tion y(t, r), continuous and non-negative in 0 < ¢ < a, r > 0, and monotone 
non-decreasing in r for each fixed ¢, such that there is no non-trivial solution 
of 


(22) Yr =V(t,r), 1(0) =r'(0) =0, 
on 0 < ¢ < a, and such that 
f(t, x1) — f(t, x2)| < w(t, [xi — x2!), 


for (t, X:), (t, X2) in a region 0 < ¢ <a, |x| < b. That this result is contained 
in the present results is seen by taking ¥:(t,7) = ¥(t,r) and yo(t,r) = 2M, 
where M is a bound for /f(t,x)| in 0 <t <a, |x| < 6. Then we can take 
B(t) = t. If A(é) is a solution of (4) with A(0) = A’(0) = 0, the condition 
(3) follows from 


0 = A’(0) = lim A (¢) /t = lim A(t)/B(t), 


and the hypotheses of Theorems 1 and 2 are satisfied. 
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5. Now we obtain a generalization of the Krein—Krasnosel’skii-Luxemburg 
results by application of our theorems. This will illustrate the flexibility of 
the method in generating new results. We retain the condition (21), corre- 
sponding to ¥2(t,r) = kr/t, B(t) = #, but replace (20) by 


(23) f(t, X;) ome f(t, X:)| < C\x; anni x,|*#, a< 1, B > —1, 
corresponding to the,choice y(t, r) = Cr*#. It is easy to solve (4) with this 
choice of ¥, and we find that we can take A(t) = pt®+/C-®, where p is a 


constant depending only on a, 8, C. For (3) to be satisfied, we must have 
(8 + 1)/(1 — a) —k>O,or68+1> Rk(1 — a). The results of (1; 6; 7) are 
the special case 8 = 0. If (8 + 1)/(1 — a) < 1, the condition (23) alone 
would suffice, as can be seen from the Kamke-Coddington-Levinson results. 
Thus we assume 8 + 1 > 1 — a,or8 > — a. 

Slightly more generally, we can replace the condition (23) by 


(24) \f(t, x1) — f(t, x2)| < Clix, — x,/*A(d), 


where a < 1 and A(t) = o[f*"-”-"] as t+ 0. This corresponds to the choice 
¥ilt, r) = Cr*A(t), which yields 


LAM = pf awa 


If A(t) satisfies the above growth condition, we have 
t 
f A(t)dt = of**-}, 
0 


and A(t) = o(t*) = o(B(t)), so that the hypotheses of our theorems are 
satisfied. 

These examples illustrate suitable choices of ¥(t,r) and A(t). There is 
less flexibility in the choice of ¥2(t, r), as the definition of B(#) implies that 
there can be no non-trivial solution of (5) which vanishes on an interval near 
the origin. 


6. The results obtained in this paper can also be given in the more general 
setting of (2). As the proofs are quite similar to those given here, with the 
same alterations used in (2), we shall only outline the results, without proofs. 
Instead of using a norm |x| for vectors x, we use a function V(t, x) defined for 
real ¢ and vectors x with non-negative real values, which is continuous in 
(t, x), has one-sided partial derivatives with respect to ¢ and the components 
of x, and whose vanishing implies x = 0. We use V, to denote a partial deriva- 
tive of V with respect to ¢, V, to denote some gradient vector of V, and . to 
denote the usual scalar product of vectors. Any condition which involves 
V, or V, is understood to be required for all one-sided derivatives. 

The only change in the hypotheses of the theorems is that the pair of con- 
ditions (2) are replaced by 


V(t, ie Xo) + Vz ° (f (2, X:) = f(t, X2)) < vill, Vit, ees X2)), 1 = ER 2, 
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B in the generalization of Theorem 1. For the generalization of Theorem 2, we 
must add the condition 


Ot, fF UE(s, x1(5)) — £65, ms)) Ids) 


< J {Vils, 216s) — x0(5)) + Ve ECs, 21(6)) — £66, 20(8))1} ds 


for any continuous f, x;, X2, and the condition V(max (t;, t:), X1 + X2) < 
V(x, t1) + V(X, te). This condition is essentially just a weakened form of 
the triangle inequality for the function V(t,x). The other hypotheses of 
x Theorems 1 and 2 and the conclusions remain unchanged. 


ww ee 
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THE ASYMPTOTIC BEHAVIOUR OF THE 
LAURENT COEFFICIENTS 


MAX WYMAN 


Introduction. Let G(z) be a function of a complex variable, regular in 
the annulus 0 < a < |z| <b < @~. We shall assume there exists a curve 
within the annulus for which 


lim G(z) = @, 

lzl+d 
provided z is restricted to be a point of this curve. Under these restrictions 
G(z) has a Laurent expansion of the form 


(1.1) G(z) = > a,2", 
where the Laurent coefficients a, have the integral representation 
(1.2) a, = Qniy* { Ge) or" de, 

c 


and C can be any contour, within the domain of regularity, that encloses 
z = 0. We shall also assume that the a, are all real numbers. Using the usual 
complex conjugate notation, we can, therefore, write 


(1.3) G(z) = G(2). 


The problem of determining the asymptotic behaviour of a, as n — © is 
very old in mathematical literature and appears in many forms and disguises. 
It has been solved for specific classes of functions by many people using a 
multitude of methods. For certain lacunary type series a, can have an almost 
chaotic behaviour for large n. It is, therefore, too much to expect that there 
exists a single method that will give the asymptotic behaviour of a, for the 
class of all functions that possess Laurent expansions. We shall, therefore, in 
this paper, consider only a single method that yields the asymptotic behaviour 
of a, for a particular class of generating functions G(z) which will be called 
admissible. The major goal is to make the class of admissible functions as 
large as possible. 

As far as we are aware the first attack on problems of this type, that could 
claim any degree of generality, is due to Darboux (1). His class of admissible 
functions possessed Maclaurin expansions with a finite radius of convergence. 
On the circle of convergence G(z) was allowed to have only a finite number of 
singularities of a particular type. The success of Darboux’s method is almost 
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entirely due to the fact that his assumptions allow us to deform the contour 
C beyond the circle of convergence and ultimately to prove that only small 
contours surrounding the singularities of G(z) contribute to the asymptotic 
behaviour of a,. This procedure is almost characteristic of all methods that 
depend for their success on the classification of the number and types of 
singularities the generating function G(z) is allowed to have on the circle of 
convergence. 

Such deformations are not available if the circle of convergence is a natural 
boundary of G(z). In particular the partition function 


(1.4) G(s) = I a—-2)"-= > p(n) 2" 


falls into this classification. There exists, in the literature, several derivations 
of the behaviour of p(m) asn — @. 

The success of known methods of studying the asymptotic behaviour of 
sequences, defined by integrals, lies almost entirely in the fact that the 
behaviour is determined by a knowledge of G(z) in the neighbourhoods of 
certain points called critical points. Erdélyi (2) has pointed out that there 
does not exist any general theory of critical points and that only a few types 
of integrals have been studied. We shall provide such a theory for integrals 
of the type (1.2) and for which G(z) belongs to our class of admissible func- 
tions. 

Among recent papers dealing with problems of this type we note those of 
Szekeres (8), Hayman (4), and Moser and Wyman (5; 6). Szekeres finds a 
complete asymptotic expansion for a specific example. Hayman introduced 
the idea of a class of admissible functions and found the first term of an 
asymptotic expansion for the Maclaurin coefficient of all members of his 
class of functions. Wyman and Moser find complete asymptotic expansions 
for the Maclaurin coefficients of a class of integral functions. We note these 
papers in particular because in each case there is a more or less common 
pattern of attack. In our present .paper we shall attempt to generalize the 
general pattern of attack of the papers mentioned above. 


2. Asymptotic expansions. In applying a method of critical points to 
integrals of the form (1.2) we place suitable restrictions on G(z) that enable 
us to determine the asymptotic behaviour of a, by considering only restricted 
portions of the contour C. It is thus clear that such a method implies a criterion 
by means of which we may recognize the portions of C that are to be retained 
and to recognize also the portions of C that may be discarded. This criterion 
is found in the choice of a suitable definition of an asymptotic expansion. 

Since Poincaré’s introduction of a definition of an asymptotic expansion, 
the concept has been generalized in many ways. Erdélyi (2) has given an 
elegant treatment of the concept which can be used to develop a general 
theory of asymptotic expansions. However, his treatment imposes certain 
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restrictions that are neither necessary nor desirable for the type of problem 
we shall consider in this paper. In a private conversation Professor Erdélyi 
has given me a more general definition that completely fills our present needs. 
We shall, however, use only a very specialized version of Erdélyi’s later 
definition. 

Throughout the paper 7 shall denote a real variable lying in the fixed 
interval J given by 0 << a<r<6b< @. We shall be interested in the 
behaviour of certain real or complex valued functions F(r) as r—> 6b. The 
end-point a plays no essential role in our discussion except in so far as all 
statements are required to have meaning only if r is restricted to be a value 
in J. 


Définition 2.1. A real valued positive function V(r) is called a comparison 
function if 
lim V(r) = @. 
r+d 
Définition 2.2. Two real or complex valued functions f(r) and g(r) are said 
to be asymptotically equal at r = b, with respect to the comparison function 
Vir), if 
(2.1) lim v* (f(r) — g(r)) = 0 


for every non-negative integer n. 
We shall write this relationship as 


(2.2) f(r) = g(r) 
or in terms of the order relation o as 
(2.3) f(r) — g(r) = o( V™), 


for every non-negative integer n. 

Since we shall usually require 6 and V(r) to remain fixed in our discussion 
we shall often delete the qualifying phrases in the definition of asymptotic 
equality. 

If there exists a value ro in J and positive real numbers a, 8 such that 


(2.4) f(r) — g(r)| = O(exp(— 6 V*)) 


for all r > ro, then f(r) = g(r). 

Asymptotic equality is an equivalence relation and divides the class of 
complex functions defined on J into equivalence classes. In our development 
we shall usually replace a function f(r) = 0 by zero. If b = ~, V(r) = logr 
then r, (r? + 1)/r are examples of a pair of asymptotically equal functions. 


Définition 2.3. Let A,;(r), k = 0,1,2,... be an infinite sequence of com- 
plex valued functions. The formal series 


co 
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is called an asymptotic expansion of a function F(r) if, for every non-negative 
integer m, 


(2.5) lim y” (- = > A, v*) = 0. 
We write 

(2.6) Fw } A, V>* 

or 

(2.7) F- > A, V* = o(V-”). 


The asymptotic equality of two functions is a special case of an asymptotic 
expansion with A, = 0, k > 1. It is also easy to show that the definition of 
an asymptotic expansion implies A, = o(V), k > 1 and that the asymptotic 
expansion of a function is not unique. Any of the A,(r) may be replaced by 
an asymptotically equal function. 


Définition 2.4. If F(r) and H(r) are two complex valued functions such 
that 


« 


(2.8) F/H~ >> A, V™* 
k=0 

then we write 

(2.9) F~H > A, V~. 
k=0 


At first glance it might be thought that Definition (2.4) is redundant in 
the light of the fact that the A,(r) of Definition (2.3) are allowed to be 
functions of r. This, however, is not so because (2.9) implies only that A, = o( V) 
while 


r 


(2.10) Fr 2D (HA) v* 

*. k= 
implies HA, = o(V). Hence (2.10) may or may not place a restriction on H 
that is not implied by (2.9). 

The major distinction between our use at present of the meaning of an 
asymptotic expansion and Erdélyi’s published version is the fact that we do 
not require A,V~* to be an asymptotic sequence, even though V~ is such a 
sequence. This situation could exist, for example, if some of the A, = 0. 


3. Relevant paths. Returning to (1.1), we may, without any loss of 
generality, assume that we are interested in the asymptotic behaviour of a, 
for large positive m. The behaviour of a,, for large negative m, could then be 
obtained from the generating function G(1/z). If G(z) has the form 


(3.1) G(z) = P,(z) + Q(z) 
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where P,,(z) is a polynomial of degree m and Q is a regular function of its 
argument, then our problem is trivial. For such a function a, = 0,2 > m + 1. 
Hence we shall exclude all functions of the form (3.1) from the remainder 
of our discussion. 

Using the polar form z = r exp(i#) we define M(r, 0) by 
(3.2) M(r, 0) = |G(r exp(z 6))|. 


Since we shall always operate within the domain of regularity of G(z) for 
which (1.1) is true, r shall always lie in the interval J of the previous section. 
The function M(r, 6) is a periodic function of @ with a period of 2x and has 
continuous partial derivatives of all orders except possibly at the zeros of 
G(z). The function M(r) defined by 
(3.3) M(r) = sup M(r, @) 
is the so-called maximum modulus function and it is known that there always 
exists a point z = z(r), on |z| = r, such that |G(z(r))| = M(r). By varying 
r, 2 = 2(r) can be considered as the parametric representation of a curve in 
the complex z plane which is called a path of maximum modulus. The path 
of maximum modulus need not be unique. From the assumptions made, in 
the Introduction, on G(z) it is clear that 
lim M(r) = @ 
Td 

and hence log M(r) is a comparison function. The choice of log M(r) as a 
comparison function may, however, introduce a complexity that is neither 
necessary nor desirable. Hardy (3) has shown that G(z) = exp(sin z) exp(exp 2”) 
has M(r) = exp|sinr | exp(expr*). The corresponding path of maximum 
modulusis@ = 0,2kr <r < (2k+1)randé@ = wz, (2kR+1)" <r < (2k + 2)z. 
Hence the path of maximum modulus has an infinite number of discontinuities. 
For this reason, we shall see, the a priori choice of log M(r) as a comparison 
function often introduces a discontinuous picture of our problem that can 
easily be avoided. Previous authors have featured log M(r) in their dis- 
cussions and have required @ = 0 to be the continuous path of maximum 
modulus. Such a starting point greatly reduces, for no valid reason, the extent 
of our class of admissible generating functions G(z). 

Let us return to (3.2) and consider the stationary values of M(r, @) which 
are given as solutions of the equation 

aM 
(3.4) i 0. 
There always exists at least one real solution of (3.4) 6 = @(r). In fact our 
assumption that all a, are real implies that @ = 0 or @ = =z are always solutions 
of (3.4) except possibly for the case when M(r, 0) = Oor M(r, x) = 0. Implicit 
function theory tells us that if (ro, 0) is a solution of (3.4), for which 
a°M 
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then there always exist neighbourhoods |r — ro| < 5, |@ — | < 8 for which 
the solution @ = @(r) is unique and has a continuous derivative. We may, 
therefore, consider such a solution @ = @(r) as the polar equation of a path, 
in the complex z plane, which lies within the domain of regularity of G(z). If 
we traverse such a path, with increasing 7, the path remains unique until 
we strike a point at which 

a°M 0 

5 iting 
At such a point the path may cease to be unique and several branches may 
appear. However if r is sufficiently close to 6, then (9, pp. 22-7), the number 
of such branch paths remains finite. Although @(r) need not be differentiable 
at a branch point, the continuity of @(r) is preserved. All paths determined 
as solutions of (3.4) shall be called the stationary paths of G(z). 

We interrupt our general development to consider the specific function 

exp (z/1 + s”)) which gives considerable insight into the general situation. 
The solutions of (3.4) are 


(3.5) 6=0,0=27,0<Er<ev2-1 


(3.6) 6=0, 6= 2, 6 = + arccos((1 — r*)/2r), 6 = * + arccos 
((1 — r*)/2r), /2-1l<r<l. 


The points, (0, /2 — 1), (4, /2 — 1), are branch points and there exist six 
stationary paths by means of which we may leave z = 0 and arrive at the 
boundary |z| = 1. 

We note that the number of intersections of the stationary paths with the 
circle |z| = r is a function of r and indicates the possibility that, for other 
generating functions G(z), that the number of such intersections may tend 
to @ asr — bd. Let us now consider the stationary path @ = 0,0 <r < 1. For 
a fixed value of r and variable 6, M(r,0) is a maximum of M(r, @), for 
0<r<+J/l1—2, and then M(r,0) changes its character to become a 
minimum of M(r, 6) for V2 — 1 <r < 1. Again the example indicates the 
possibility that there may exist stationary paths @ = @(r) such that, along 
such a path, M(r, @(r)) may change its character from being a maximum of 
M(r, 6) to being a minimum of M(r, @) an infinite number of times. 

Returning to the general case we will denote by L,, (k = 1,2,..., 7), the 
stationary paths contained in the annulus rp < |z| < r < 6. We note that the 
integer J may be a function of r. The corresponding polar equations are 
written as 6 = 6,(r) and M,(r) is defined by 


(3.7) M,(r) = M(r, &(r)). 


Assumption (1). For r sufficiently close to } there exists a continuous 
stationary path, with polar equation @ = 6;(r), by means of which we can 
reach the boundary r = b. Further M(r, 6)/M, is bounded uniformly in @. 

Since the maximum modulus M(r) is always attained for every value of r 
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at some value of @ assumption (1) implies that M(r)/M, is bounded as r — 5. 
However we have previously assumed that 

lim M(r) = @. 

r+»d 
This means that 

lim M,(r) = o. 

rd 
Thus log M;(r) is a comparison function by means of which we may give 
meaning to the concepts of asymptotic equality and asymptotic expansions. 
Throughout the remainder of the paper log M,(r) will be our comparison 
function and (log M,(r))-” will be our asymptotic sequence. 


Définition. A stationary path for which M,/M, = 0 will be called an 
irrelevant path. All other stationary paths will be called relevant paths. 


Assumption (2). We shall assume that the relevant paths can be identified 


in such a way that the following properties are true for r sufficiently close 
to b. 


(a) The relevant paths are all continuous curves by means of which we 
may reach the boundary r = b. 
(b) If @ = @,(r) is a relevant path then a constant K, > 0 and a non- 
negative integer m exists such that M,/M, > K;, (log M,)~. 
(c) The number N of relevant paths is fixed and independent of r. 
(d) For every relevant path @ = 6,(r) the 
lim 6,(r) 
r+d 
exists. Further if 6 = 6,(r) and @ = @,(r) are distinct relevant paths then 
lim 6,(r) ¥ lim 6,(r). 
r+d r+d 
(e) Along every relevant path the function M/(r, @) has the property 
a°M 
67 < 0. 


An effect of assumption (2) is that for r close to 6 the relevant paths can 
have no point of intersection. Such a point would be a branch point at which 





This would of course contradict the assumption. The assumption also guarantees 
that there can be no point of intersection of relevant paths even at r = b. We 
may also conclude that the number of intersections of the relevant paths with 
the circle |z| = r < 5 is equal to the fixed number NV. Hence without any loss 
of generality we may assume that the relevant paths are L,, (k = 1,2,..., N) 
and that they have been numbered in a counter-clockwise direction beginning 
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with the featured path L;. Along every relevant path the value M;,(r) is, for 
fixed r and variable @, a maximum value of M(r, 6) as r — bd. Finally, we also 
note that the functions 6,(r) corresponding to relevant paths all have con- 
tinuous first derivatives. 

Since (any constant)/M,(r) = 0 it is clear that M,(r), (k = 1,2,..., N) 
must all be unbounded functions of r as r — b. Since M(r)/M,(r) and M,(r)/ 
M(r) are both bounded functions we must have 
(3.8) lim log M(r)/log M,(r) = 1. 


r+d 


This result can be extended to 


(3.9) lim (log M;(r) /log Mi(r)) = 1, k= 1,2,...,N. 
r+d 


It is not difficult to show that the function exp(sin z)exp(exp 2?) obeys 
assumptions (1) and (2). 


4. Functional behaviour along relevant paths. In the present section 
we shall investigate the functional behaviour of log G(z) and its derivatives 
along relevant paths. Since it turns out to be simpler to study the derivatives 
with respect to log z we introduce the operator H by 


d 
(4.1) H= P 
and attach the usual meaning to the iterated operator H”. Further, the 


symbol H"F(w) shall always mean H”F(z)],... In this notation (3.4) becomes 
(4.2) Im (H log G(z)) = 0 


and hence the quantity H log G(z) is always real along every relevant path Ly. 
Let us denote the points of intersection of ZL, with |z| = r < 6 by 
(4.3) zs=2,(r);. |x| = 7, k= 1,2,...,N. 


For variable r, (4.3) provides a parametric representation of the relevant 
paths. Further z,(r) has a continuous derivative with respect to r as r — 6, 
since 6,(r) has this property. Since |G(z,)| = M;,(r) and M;,(r) is a maximum 
of M(r, 0) we must have G(z,) # 0. Thus all of the functions H™ log G(z,(r)) 
have a continuous derivative with respect to r for r sufficiently close to bd. 
The condition 


a°M 
er <0 
can be translated to read 
(4.4) RI (H? log G(z)) > 0, 


along every relevant path. 
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LemMA 4.1. The function H log G(z) is increasing in r and has no upper 
bound along any relevant path. 


Proof. Let 2) be any fixed point along a relevant path and suppose z is a 
variable point along the same path. Along such a path H log G(z) has been 
shown to be real. If there exists a constant K such that H log G(z) < K then 
the identity 


(4.5) log G(z) = log G(zs) + fu log G(z) d(log z), 
where the path of integration is the relevant path, implies 


(4.6) IG(z)| < |G(zo)| (r/|zo|)*. 


However, since our path of integration is a relevant path L,; we must have 
|G(z)| = M,(r). If 6 is finite, (4.6) implies that M,(r) is a bounded function 
of r as r — b. This contradicts our assumption that L, is a relevant path. If 6 
is infinite the proof is somewhat more difficult but follows directly from 
known complex function theory as long as we remember that functions of 
the form (3.1) are excluded. 

Along any relevant path we know that H log G(z) has a continuous first 
derivative with respect to r, if r is close enough to 6. An easy computation 
gives 


2 
(4.7) r+ Blog Gz) = RI H* log G(s) (1 +r’ (#) ) , 
dr dr 
From (4.4) we have d/dr H log G(z) > 0 along a relevant path and hence 
H log G(z) must be an increasing function of r along every such path. Since 
H log G(z) is an increasing continuous function of r, which does not have an 
upper bound, along every relevant path we must have 
(4.8) lim H log G(z) = + ©. 
rd 

Thus far in our paper we have been discussing the functional behaviour 
of G(z) and have not related this behaviour to the Laurent coefficients a,. 
In order to make a start on obtaining this relationship we shall for the moment 
allow n to be a large positive real number. Ultimately n will assume its integer 
meaning in a,. The results contained in Lemma 4.1 imply that, along any 
relevant path Z,, the equation 


(4.9) H log G(z) =n 


always has a unique solution z = Z,(m), for m sufficiently large. Considering 
nm as a variable, the equations 


(4.10) z= Z,(n) 


provide a second parametric representation of L,. Since it becomes con- 
venient to use both of the parametric representations (4.3) and (4.10), simul- 
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taneously in some discussions, we introduce the following convention to avoid 
confusion. 


Convention. If r is taken to be an independent variable then n shall mean 
the function » = n(r) defined by 


(4.11) n(r) = H log G(z2,(r)). 


Thus 2;(r) = Z;(m) and we may consider both sets of points z,(r), Z,(m) as 
completely determined. 

If m is taken to be an independent variable then r is taken to mean the 
function r(m) defined by 


(4.12) r(n) = Z,(n), 


and again 2;(r) = Z,(m). As before %(r), Z,(m) are completely determined 
by this convention. This notation implies 
(4.13) lim n(r) = @, limr(n) = b. 
r+d Naw 
Assumption (3). We shall assume that, for r sufficiently close to 5b each 
of the following are true. 
(a) There exist positive constants p, P:, P2, such that 


(4.14) P, (log My) < n(r) < P2 ((log M,)'**) 


(b) Z,(m)/z(r) = 1 + O(1/n). 

(c) In the complex w plane there exists a fixed neighbourhood |w| < A for 
which the functions log G(Z,(m) exp(log Myw/n)) are all regular functions 
of w. Further, 


lim [log G(Z,(m) exp(log Mi(r) w/n)) — log G(Z,(m))]/log M, 


exists, uniformly in w, for all w within and on the boundary of |w| < hk. We 
shall denote this limit by g,(w) and shall assume Rig,’’(0) ¥ 0. 


Definition 4.1. Any generating function having a Laurent expansion of the 
type (1.1) and for which assumptions (1), (2), and (3) are true will be called 
an admissible function. 


The major result of this paper will be that the asymptotic behaviour of 
the Laurent coefficients of every admissible function can be determined. At 
this stage the conditions contained in assumption (3) must seem somewhat 
like pulling rabbits out of a hat. However these conditions do arise quite 
naturally in the method that we shall use to prove our central theorem. 

Part (a) of assumption (3) makes all members of the original class of 
functions considered by Darboux inadmissible. As such our central theorem 
will be complimentary to the Darboux result. The second part of the assump- 
tion turns out to be sufficient to control the behaviour of the factor z~“*+” 
in the integrand of (1.2). I suspect that this part of assumption (3) may be 
completely unnecessary and that every admissible function may have this 
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property. The third part of the assumption involves functions that are 
reminiscent of the indicator functions of integral function theory. In some 
cases the indicator function can be derived from g,(w). Obviously the third 
part of this assumption places some restrictions on the locations of the 
zeros of G(z). 

To be a member of Hayman’s class of admissible functions a generating 
function G(z) has to have a unique path of maximum modulus that is 6 = 0. 
Certainly assumption (1) is considerably weaker than this condition. To this 
extent our class of admissible functions is considerably more extensive than 
is Hayman’s. It is not, however, possible to say that our class of such functions 
completely contains Hayman’s. Our assumptions have been designed to measure 
the contribution of any portion of the contour C, of (1.2), to any term in a 
complete asymptotic expansion. Hayman’s assumptions were designed to 
measure the contributions of C to the first term of such an expansion. If we 
were to restrict ourselves to Hayman’s objective we could, along the lines of 
the present paper, greatly extend Hayman’s class of admissible functions. 

When the outer boundary of convergence is finite our assumptions say, in 
essence, that G(z) may have an infinite number of singularities on r = b. 
However, there can be only a finite number that dominate all the rest. The 
meaning of dominate is of course derived from our definition of asymptotic 
equality. 

The two specific functions used for illustration in this paper are both 
admissible functions in our sense and neither is admissible in the Hayman 
sense. To illustrate another point we shall prove that 


(4.15) G(z) = [exp(1/2(1 — 2?))]/(1 — 2) 
is an admissible function. For (4.15) it is not too difficult to prove that 
(4.16) z=r and zs=-r 


are the only two possible relevant paths. Since z = r is the path of maximum 
modulus and since this path is continuous, for this example we choose 














(4.17) ai(r) = 7, 2:(r) => — 7. 
Hence 
_ exp(¥(1—1°*)) , _ exp(}(1 — 7°) 
(4.18) M,(r) = oy: , M,(r) = re 
Since 
. (log Mi(r))M2(r) _ 1 

(4.19) lim Mir) = 3: 
we see that both paths are relevant. The fact that 

a°M 

- 


on a relevant path is easily verified. This means assumption (1) is satisfied. 
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Turning to equation (4.9) the solutions are 


1 3 1 

(4.20) Zi(n) = 1—- Fy gat aaypateee 

1 1 1 
(4.21) Z:,(n) = —1 +5) “i Si) 7 Pina 
Since Z,(m) = 2,(r) = r we find 

1 3 1 

(4.22) alr) = —~l+aRmat+se aay 4t::: 
and 
(4.23) Z2(m)/22(r) = 1 + O(1/n). 
Since log M,(r) = 3(n)' + ..., the first and second parts of assumption (3) 


are true. One can readily verify that g:(w) = g.(w) = w/1 — w and that the 
remaining conditions of assumption (3) are true. Thus (4.15) gives a third 
example of an admissible function. It is interesting to note that (4.15) is 
not a member of Hayman’s class of admissible functions. Actually (4.15) 
violates one of Hayman’s assumptions that is not vital to the success of the 
method. This example provides an illustration of how Hayman’s class of 
admissible functions could be extended along the lines of our present paper. 
The second relevant path contributes only to the second term of the asymp- 
totic expansion. 





5. Further preliminary results. From assumption 3(c) it is easily seen 
that log G(Z, exp u) is a regular function of the complex variable u pro- 
viding |u| < hk (log M,)/n. In this neighbourhood we may expand log G 
(Z, exp u) into a Maclaurin expansion and obtain 


2 
(5.1) log G(Z, exp u) = log G(Z,) + H log G(Z,)u + H'log G(Z,) = 
+ > H™ log G(Z,) =. 
m=3 m: 
Since H log G(Z,) = m we may, by choosing u = log (2/Z,), say 
2 
(5.2) log G(s) = log G(Z,) + m log(2/Z,) + H’log G(Zx) . ro 
+ D0 H" log G(Z,) (log(s/Z.))"/m!, 


providing |log (z/Z,)| < h (log M,)/n. If we denote the set of points satisfying 
llog (2/Z,)| < h log( M,)/n by D,, then the point z = 2,(r) lies in D,. 
Assumption (3) tells us that 


(5.3) lim (log G(Z, exp(log M,(r) w/n)) — log G(Z,)) /log Mi(r) = gx(w), 


and that the limit exists uniformly in w, for |w| < 4. We may, therefore, 
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differentiate (5.3) m times with respect to w and then place w = 0. This 
gives 


(5.4) fim ((log M,(r))"~* H™log G(Z,)/n™) = gi” (0). 
From (5.4) the following results can easily be obtained. 
(5.5) g(0)=0, = gr’(0) = 1. 
(5.6) R1 g,’"(0) > 0. 
There exist constants P; and P, such that 
(5.7) P; (log Mi(r)) < |H*log G(Z,)| < Ps ((log Mi(r))'**). 
(5.8) lim |H’log G(Z,)| = @. 
(5.9) im arg(H*log G(Z,)) = arg gx”’(0). 
(5.10) H'log G(Z,) = 0(H*log G(Z;)). 
(5.11) |H™ log G(Z,)/m!| = 0(n™/h™ (log M,)"""). 
If we define y by 
(5.12) ve = lim arg (H"log G(Z:)) 
then (5.6) and (5.9) tell us that 
(5.13) lvx| < $x. 
We shall define an infinitesimal « by 
(5.14) e = |H? log G(Z,)|* 


where, for reasons that shall appear later, we restrict a to satisfy 
(5.15) (6p + 2)/6(2p + 1) < a@ < 3. 


The constant p is taken from assumption (3). Since the R/ (H? log G(Z,))>0 
we may introduce (H* log G(Z,))! in an unambiguous manner by taking the 
branch with a positive real part. The quantity ¢0,(r) defined by 


(5.16) ¢o2(r) = |H? log G(Z;)|-* (H* log G(Z,))* 
has the properties that 


” ; 
(5.17) — | doe(r)| ~ may (n® /log M;)** = O[(log M)°*”?*} 


and 


(5.18) lim |¢o,.(r)| = @, 
r+d 


and 
(5.19) lim larg do.2(7)| = 3|ve| < 3. 
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Since 


lim ne = @, 


Td 
we see that 1/n is a higher order infinitesimal than «. Further, the fact that 


lim ne/log M,(r) = 0 
rd 


implies that any point z lying on the line joining 2 exp ie to Z, exp ie is a 
member of the domain D,. For such points z we may prove 


(5.20) log (2/Z,) = ie + O(1/n). 


Further, since 1/m is a higher order infinitesimal than « we may, to this 
order of approximation, write 


(5.21) log (2/Z,) = te. 


This fact plus (5.2) enables us to prove that 


2 
(5.22) log(G(z)/2") = log(G(Zx)/Zt) — 5 H’log G(Z,) + 0(n'e*/ (log Mi)’). 


From (5.4) 
(5.23) lim [(log M1) H’log G(Z,) /n*] = g,’"(0) 
r+d 
and from (5.6) g,’"(0) # 0. Hence 
(5.24) H? log G(Z,x) = n*(g,’(0) + 0(1))/log M,. 
Thus (5.22) may be written 


2 
(5.25) log (G(z) Zz/2"G(Z,)) = 5 H’log G(Z,)(1 + O(ne/log M;)). 


We have already seen that 
lim (ne/log M,) = 0. 


td 


Hence (5.25) can be shown to imply that a constant P; > 0 exists such that 
(5.26) |G(z) Zt/G(Z;,) 2"| = O(exp(—Ps(log M1)"™)). 


Hence |G(z) Z,"/G(Z,)z"| = 0 with respect to log M,(r). Since G(Z,)/M,(r) 
and (r/Z,)" are bounded functions of r this means 


(5.27) |G(z) r*/M,(r)2"| = 0, with respect to log M,(r). 


In particular (5.27) is true for the points z, exp (ie) and Z,; exp (ie). Obviously 
the same result may be obtained by replacing « by — «. We use (5.27) to 
prove the following lemma. 


Lemma 5.1. The quantity I(r) defined by 
(5.28) I(r) = (7° /Mi(r)) f G(z) ot” dz 
c 
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ts = 0, with respect to log M,(r), providing L is taken to be one of the following 
paths of integration. 
(a) The arc of the circle |z| = r joining the points z, exp (te) to 2,41 exp(—ie) 
(b) The line joining z, exp (ie) to Z, exp (ie). 
(c) The line joining z, exp (— ie) to Z, exp (— ie). 


Proof. Our notation implies that z,(r), 2.41(r) are two points of intersection 
of two consecutive relevant paths with z = r. By assumption (2) 


lim arg z(r) 

r-»d 
exist for k = 1,2,..., N and, further, the values of these limits are distinct. 
Hence our numbering system can be assumed, without any loss of generality, 
to imply that 


lim arg a (r) < lim arg 2,41. 
+d 


r+d r 


Further, since e was defined by (5.14) we have 


lim arg(z,(r) exp(ze)) = lim arg z,(r). 
r+d 


r+d 


This of course means that for 7 close to } 
arg %(r) < arg (z(r) exp (ie)) < arg (ze4:(r) exp (— te)) < arg 2g4:(r). 


This means that there can be no points of intersection of a relevant path and 
lz] = r contained in arg (z(r) exp (ie) < @ < arg (Ze4:(r) exp (— ie). Thus 
on the path of integration given by part (a) we must have |G(z)| less than 
or equal to the value of |G(z)| at a maximum which is on an irrelevant path, 
or |G(z)| is less than or equal to the value of |G(z)| at one of the end-points. In 
either case I(r) = 0. To make our proof consistent even if the points zy and 
z, are involved we adopt the convention that |zy4:| = |z:| but arg zy41 = arg 
21 + 2n. 

For the straight line paths of integration (5.27) is sufficient to establish 
the result. As one would suspect, Lemma 5.1 enables us to recognize the 
portions of the contour C of (1.2) that can be discarded. 

To complete our preliminary results we now proceed to study the behaviour 
of certain functions which enter into the derivation of the central theorem 
of the next section of our paper. Let ¢ be a complex variable and suppose 
we restrict @ to be on the line, in the complex ¢ plane, joining the points 
dox(r) to — dox(r) where $0, is given by (5.16). We define a,, ,, (m = 3, 4,..., 
R=1,2,...,N), by 


(5.29) dmx = (log Mi(r))*"-H™ log G(Z,) (i @)"/(m!($H? log G(Z,))"/?) 
then (5.11) and (5.24) combined with (5.29) give 


(5.30) lam x| = 0((Q\¢|)™), 
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where Q is a positive constant. For this reason the functions f,(r, ¢, u), of 
the complex variable u, defined by 


(5.31) Fults 4) = De ms.ntl” 
are regular functions of u as long as Q|¢| |u| <1. From the fact that 
lo! < lool, (5.15) implies that if |u| < 2 (log Mi(r))~ then 
lim |¢u| = 0. 
r+d 
Thus the point u = (log M,(r))+ is certainly within the domain of regu- 


larity of f,(r, ¢, wu). Let us now go on to consider the functions F,(r, ¢, «) 
defined by 


(5.32) F, (r, ?, u) = exp fi (r, >, u). 


All of the functions F,(r, ¢, «) have Maclaurin expansions about u = 0 and 
the radii of convergence of these expansions are certainly all > 2 (log M,(r))-4 
We write 


(5.33) F,(r, ¢,u) = 1+ 4 bm,x(r, &) 0". 

We can easily establish that 

(5.34) bom+1,e is a polynomial in @ and is an odd function of ¢. 
(5.35) bomx is a polynomial in ¢ and is an even function of ¢. 


(5.36) lbm.x| = O((Peld|)*™), for large |¢|, and Ps. and the order relation 
involve constants that are independent of r. 


The result (5.36) follows from a lemma of Moser and Wyman (7). Further, 
(5.15) implies that 


lim |o|* (log M,(r))* = 0. 
rb 


Hence as long as |u| < 2 (log M,(r)-3 we must have 








nad > bm.x(r, 6) u™!| = O((ps|o|)°*** (log Mi(r)) 4°"). 


From the fact that 
exp 18 . 
f ,xP(—¢ )) odo 


exp { 


exists as long as |8| < 2/4 it is easily shown that 


= O((log Mi(r))**”) 





(5.38) ro" ( * bmx (7, ) i”) exp(—¢’) d¢ 


s+ 


where in (5.38) the constants entering into the order relation may depend 
on s but not on r. In particular if R,, is given by 
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(5.39) Ry = f . > (bm.x(r, ) (log Mi(r))~™) exp(— 9”) de 
—0.k s+1 

then 

(5.40) IRs.x| = O( (log Mi(r))-@*”). 


This completes the preliminary results necessary for the proof of our central 
theorem. 


6. The asymptotic formula. From (1.2) the Laurent coefficients a, 
are given by 


(6.1) a, = ri) [ Ge) fr" &. 
c 


where C can be any contour within the domain of regularity of G(z) and 
enclosing z = 0. We shall assume m is a large positive integer. Hence the 


convention of § 4 that a choice of m determines the value r = r(m) = |Z,(m)|, 
where Z;(m) is the featured solution of 
(6.2) H log G(z) = n. 


We may also consider the points z,(r), Z,(m), (2 = 1, 2,..., NW) as determined. 
The contour C is now chosen as follows. 


Contour C. The points in the z plane Z, exp (— ie), Z, exp (te) are joined 
by arcs of the circles |z| = |Z,|. The points Z, exp (— ie) and Z, exp (ie) are 
joined, respectively, to the points z, exp (— ie) and x exp (ie) by straight 
lines. Finally 2, exp (te) is joined to 2,4: exp (— ie) by the arc of the circle 
\z| = || =r. The convention adopted in a previous section concerning 
Zy+1 assures that C becomes a closed contour. Further, the previous work on 
relevant paths also ensures that the contour C does not traverse any of its 
points more than once. 

From Lemma 5.1 the only portions of C that can contribute to the asymp- 
totic expansion of r*a,/M;, (r) are those portions that intersect a relevant 
path. For this reason we may write 


N 
(6.3) r"a,/M;,(r) ~ > I, 
k=1 
where 
(6.4) I, = (24iM,(r))""" f G(z) *” dz 
Ck 


and C;, is the arc of the circle |z| = |Z,| joining Z, exp (— ie) to Z; exp (ie). 
For this reason we study 


(6.5) K, = (2ri)™ f G(z) = **” dz. 
Ck 














ral 








oe 





ASYMPTOTIC BEHAVIOUR OF LAURENT COEFFICIENTS 551 
The substitution z = Z, exp 1@ makes (6.5) become 
(6.6) Ky = (2n)'Z3" f ‘GZ, exp i@) exp(— in®) dd. 
Expanding log G(Z, exp 1@) in a Maclaurin expansion about @ = 0 we have 
(6.7) log G(Z, exp 10) — ind = log G(Z,) — 4 H’log G(Z,) @ 
+ Lo H™ log G(Z,) ia)" /m! 
Hence the substitution 


(6.8) ¢ = (4H? log G(Z,))40 
into (6.6) yields 


(6.9) 2n K, ™ G(Z,)Z," (4 H’log G(Z,))* xc 9%, (log M,(r))™) 
exp (—¢°) do 


where $0, is given by (5.16) and F;,(r, ¢, u) by (5.32). From (5.33) and (5.39) 
we may write 


(6.10) - Far, ¢, (log M,(r))*) exp(—¢"*) do 


> f. exp(—o')(1+ Dy burl, ¢)(log Ma(r))™") do + Rava 


[ en(-# + > bem,x (log Mx(r))~") do + O( (log My(r))~* 


because of (5.34) and (5.40). Further, since bo,.(7,¢) is a polynomial in ¢ 
we include only terms which are exponentially small by replacing ¢o, by 
© exp (i¥,), and without any loss in generality we may consider ¢ to be now 


a real variable and take limits of integration from — © to o. If we let 
An. x(r) be given by 

1 
(6.11) Amal?) = Sz J” exp(— 6") banalr, 6) d6 


then we have proven 
(6.12) [Flr & (log Mi))) exp(—#") de 
—$0.8 
~ vat + S Am,x(log M()*) 


and (6.9) gives 
(6.13) Ky ~ G(Z,)Z;,"(22 H’log G(zyy%(1 + > Am,x(log My) . 
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This in turn gives the complete asymptotic expansion for J, in (6.4) and a, 
from (6.3). Although by assumption G(Z,)/M;,(r) is not asymptotically equal 
to zero, it is possible that 


1 
G(Z,)/Mi(r) = Agta s) ; 


and in such a case J, and J, would be terms of different orders. For this reason 
it is better to leave (6.13) as our major formula and in each specific case 
write out (6.3) after the different orders of the K, have been established. We 
shall illustrate the procedure in the next section. Thus we have established 
the following central theorem of our paper. 


CENTRAL THEOREM. A complete asymptotic expansion of the Laurent co- 
efficients of any generating function G(z) that is admissible can be obtained by 
the formulae established in this section. 


In fact it is possible to show that the first two terms of (6.13) are given by 


G(Zx) 
Zi(2e H log G(Z,))' 


[1 4. 3H'log G(Z,) H'log G(Z,) — 5(H'log G(2))") 
24 (H’log G(Z,))* 


In using these formulae we must examine the solutions of 








(6.15) H log G(z) = n 

and from these we must select the featured path z = Z,(m) and establish all 
of the relevant paths z = Z,(m), (k = 1,2,..., N). The variable r is deter- 
mined by 

(6.16) r = |Z,(n)| 


and all quantities involved in the asymptotic expansion may then be com- 
puted. 


7. Specific examples. Since we have already shown that [exp($(1 — 2*))] 
/(1 — 2) is an admissible function we shall use this generating function as 
our first illustration. From (4.20) and (4.21) we have 


.- * 1 
_ re Zs) l—sa-g tat: 
7.2) a eo ow See Sey 

(7. (n) = bb — ghar... 


Thus 


(7.3) My(r) = 2n' mbon( - wa Se .) = G(Z;). 


o— 
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(7.4) G(Zs) = ge“ ( + ao +... ) ; 

(7.5) H’log G(Z;) = 4n*” (1 - 3) ; 

(7.6) H’log G(Z2) = 4n*” (1 +...). 

(7.7) H'log G(Z;) = 24n* (1 +...). 

(7.8) H'log G(Z;) = 192 n*” (1+...). 
ni+1/4 ( 5 

(7.9) Ki~ Gnu 1+oa+...). 


(—1)" enit0.25 
(7.10) Ka~ “Tena ge See % 


Thus we have an illustration of a case where K; is a different order term 
than K,; and K;, affects only the second term of the asymptotic expansion. 
From (7.9) and (7.10) we find 


ni+1/4 n 
—e 5, (-1)"\1 ) 
a a ~ goa (1+ (5+ GP )ete). 


We have so far in the paper only used admissible functions which have 
Maclaurin expansions. One of the simplest examples of an admissible function 
that has no Maclaurin expansion about z = 0 and does have a Laurent 
expansion is the generating function exp (4x(z — z~')) for the Bessel Functions 
J,(x). We shall assume «x is fixed, real, and positive. To use our method on 
this particular generating function is a little like shooting sparrows with 
cannons. It is of course well known that the series definition of J,(x) will 
act as an asymptotic expansion under these conditions. However it does act 
to illustrate the procedure for examples of similar type but more complicated 
in nature. For this generating function there is only one relevant path 6 = 0. 


n+ (n® — x’)! 


(7.12) r = |Z,(n)| = - 





Hence 

(7.13) M,(r) = G(Z;) = exp [(n* — x*)'). 
(7.14) H'log G(Z;) = (n® — x*)'. 

(7.15) H'log G(Z;) = n. 

(7.16) H*log G(Z:) = (n* — x’)!. 





” x" exp[(n® — x*)*] ( = 2n* + 3x” ) 
(7.17) J, (x) Pn? = x) "In de (nw a x) *}" 1 24(n* — x’)*” + coe fe 
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As a final example we use the generating function exp (z/(1 + 2*)) to 
illustrate the method when the relevant paths are not straight lines. For 
this generating function there are two relevant paths. These are 


a 
(7.18) Zi(n)=i+ sr-qZt-:-::: 

— - 
(7.19) Z:(n) = i+ *t-id...- Z,(n). 


Since 2:(r) = Z,(m), 22(r) = Z2(m) the fact that exp (z/(1 + 2*)) is admissible 
can easily be verified. Hence 


(7.20) G(Z,) = [exp (4(m#(1 + 4)))] (1 +...). 
(7.21) G(Z2) = [exp (4(m#(1 — i)))] (1 +...). 
(7.22) Z:* = [exp (3(n'(1 + i) — inr))] (1+...). 
(7.23) Z: = [exp (}(n3(1 — i) + inx))] (1 +...). 


(7.24)  H® log G(Z;) 
(7.25)  H? log G(Z:) 


2(1 — ijn? (1+...). 
(2(1 + im? (1+...). 


and 
(7.26) ed 2¢"' cos [n* — i(4n — 1)] 


(2x)*(2n)°”* : 





8. Conclusion. The major result of this paper has been to give a set 
of conditions on a complex function G(z) by means of which we can recognize 
whether or not G(z) belongs to our class of admissible functions. If G(z) does 
belong to such a class then our central theorem tells us that the complete 
asymptotic behaviour of the Laurent coefficients can be determined. It would 
be desirable to have results that tell us that certain large classes of functions 
are admissible. For example, it is possible to show that all functions of the 
form P,,(z) exp (S,(z) + Q(1/z)) are admissible providing P,,(z), S,(z) are 
polynomials and Q(1/z) is a regular function of its argument. From our experi- 
ence we would say that there exists a very extensive class of integral functions 
all of which are admissible. Problems of this type would be worth investigation 
so that in specific examples one could tell, almost at a glance, whether or not 
the generating function is admissible. In such cases one can almost use our 
central theorem to write down the asymptotic behaviour of the Laurent 
coefficients. 

In our terminology Hayman’s class of admissible functions required that 
G(z) possess a unique relevant path @ = 0 that contributes to the first term 
of the asymptotic expansion. For this reason he was able to prove results 
stating that the product of admissible functions is admissible. In our case 
the two functions involved might have quite different relvant paths and 
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such a theorem is not possible. For example, the functions exp (z/(1 — z*)) 
and (exp (z/(z? — 1)))/(1 — 2) are both admissible but their product 1/(1 — 2) 
is not admissible. 

When one considers the class of functions for which the method of Darboux 
applies and the class of functions for which the method outlined in this paper 
applies, we now have a very extensive class of functions for which we can 
consider the problem of determining the behaviour of a,, as n> ©, as a 
solved problem. 
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CONVEX STRUCTURES AND CONTINUOUS SELECTIONS 
ERNEST MICHAEL 


Introduction. This paper continues the study of continuous selections 
begun in (13; 14; 15) and the expository paper (12).' The purpose of these 
papers, which is described in detail in the introduction to (13), can be sum- 
marized here as follows. If X and Y are topological spaces, and ¢ a function 
(called a carrier) from X to the space 2” of non-empty subsets of Y, then a 
selection for ¢ is a continuous f: X — Y such that f(x) € (x) for every x € X. 
For reasons which are explained in (13), we restrict our attention to carriers 
which are lower semi-continuous (l.s.c.), in the sense that, whenever U is 
open in Y, then {x € X | ¢(x) (\ U # §} is open in X. Our purpose in these 
papers is to find conditions for the existence and extendability of selections. 

The principal purpose of this paper is to generalize the following result, 
which is half of the principal theorem (Theorem 3.2”) of (13) (and is repeated 
as Theorem I of (12)). 


THEOREM A. Let S be paracompact, Y a Banach space, and © the family 
of closed, convex, non-empty subsets of Y. Then every lower semi-continuous 
carrier &: X — S admits a selection. 


In this paper, the Banach space Y is replaced by a complete metric space 
carrying an axiomatically defined convex structure (Definition 1.1) which 
permits one to take “‘convex combinations’ of some (but not necessarily all) 
ordered n-tuples of points in Y, in a suitably continuous fashion. With convex 
sets defined in the obvious way (Definition 1.4), the generalization of Theorem 
A which is thus obtained is given in Theorem 1.5 (a). The significance of this 
generalization is illustrated by the following example. 


Example A. G is a metrizable group, and H a closed subgroup which is 
isomorphic to the additive group of a Banach space. Convex combinations 
can be taken of n-tuples lying in the same right coset of H. The right cosets 
of H are then convex sets. By applying Theorem 1.5 (a) to this situation, 
it is shown in Corollary 7.3 that there exists a cross-section.? (For G a Banach 


Received May 20, 1958. This paper was written in part on a National Science Foundation 
contract at the Institute for Advanced Study, and in part on a National Science Foundation 
contract at the University of Washington. 

‘Except for § 8, no previous knowledge of continuous selections is necessary to read this 
paper, although some acquaintance with the first three sections of (13) will be helpful. 

*If G is a topological group, H a closed subgroup, u the canonical map from G to G/H, and 
if ¢:G/H —G is defined by (x) = u~'(x), then a cross-section (resp. local cross section) is a 
selection for ¢ (resp. ¢|U for some non-empty open U C G/H). 
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space and H/ a linear subspace, this already follows from Theorem A, and was 
first proved by Bartle and Graves in (2).) 

If the family of sets S, instead of having convex elements, is only what we 
shall call (Definition 1.4) equi-locally convex, then the global Theorem 1.5 (a) 
changes into the local Theorem 1.5 (b). This is illustrated by the following 
example. 


Example B. If Example A is modified only by assuming H to be locally 
isomorphic to the additive group of a Banach space, then only “close together” 
n-tuples in the same right coset of H admit convex combinations, the collection 
of right cosets of H is equi-locally convex, and Theorem 1.5 (b) implies that 
there exists a local cross-section’ (Corollary 7.3). 

Both parts of Theorem 1.5 will be obtained as simple consequences of a 
single Theorem, 1.3, which also generalizes Theorem A. The statements of 
these theorems, as well as the basic definitions, are found in § 1. § 2 contains 
some preliminary results which are not directly concerned with selections. § 3, 
the core of this paper, contains the proof of Theorem 1.3. This proof parallels 
that of Theorem A in (13), by obtaining the desired function f as the limit 
of a uniformly convergent sequence of functions f,, However, while in the 
proof of Theorem A the f,’s are continuous with f,(x) near $(x) for all x, in 
the proof of Theorem 1.3 the f,’s may be discontinuous with f,(x) in $(x) for 
all x; this new approach is necessitated by the inability to take convex com- 
binations of m-tuples in Y not lying in an element of S. The simple derivation 
of Theorem 1.5 from Theorem 1.3 is contained in § 4. 

An important source of convex structures on a metric space Y is provided 
by so-called geodesic structures on Y, which permit one to take convex com- 
binations of certain pairs of points of Y in an appropriate fashion. Geodesic 
structures are studied in § 5, where it is shown (Proposition 5.3) how they 
inductively generate convex structures in a canonical fashion. In § 6, a recent 
theorem of Nijenhuis (17) is used to show that the usual geodesic segments 
on a Riemannian manifold give rise to a geodesic structure, and hence to a 
convex one. 

The last two sections are devoted to applications of Theorems 1.3 and 
1.5 and, except when dealing with Lie groups, are independent of §§ 5 and 6. 
§ 7 deals with locally convex groups, which are, essentially, topological groups 
with an invariant convex structure defined on a neighbourhood of the identity. 
Theorem 1.5 is used to prove a cross-section theorem (Theorem 7.2) for such 
groups, which implies (Corollary 7.3) the results given in Examples A and B 
above, as well as Gleason's cross-section theorem for arbitrary Lie groups. § 8, 
finally, uses Theorem 1.3 to prove a theorem of the covering homotopy type, 
which is then applied to fibre spaces in the sense of Hurewicz (7). 

In conclusion, it should be noted that the results of this paper are of 
interest only if the domain X of the functions to be defined is infinite dimen- 


‘Ibid. 











558 ERNEST MICHAEL 


sional; for finite dimensional X, better results were already obtained in (14) 
and, with reference to § 8, in (15). 


1. Principal theorems and definitions. Throughout this paper, P, 
denotes the unit simplex in Euclidean n-space R"; that is, 


Pea ER <u <1, g=1,....00% naa}. 

i=l / 
If E is any set, then E* will denote the n-fold Cartesian product of E, and 
if i<m, then 0,: E*— E**' is defined by 90,(x1,...,%,) = (x1,...,Xi-1, 
ere 


Définition 1.1. A convex structure on a metric space E with metric p assigns 
to each positive integer a subset M, of E", and a function k,: M,XP, — E, 
such that 

(a) If x € My, then k;(x, 1) = x. 

(b) If x € M, (m > 2) and i <n, then 0 € M,_,; and, for any ¢ € P, 
with ¢;, = 0, k(x, t) = Rai (0x, 94). 

(c) If x € M, (m > 2) with x; = xy; for some 7 < n, and if ¢ € P,, then 
h(x, t) = kas (Ogz, &), where f = (t:,... , tsa, ba, 4 bess, tere, .- - » be) 

(d) If x € M,, then the map t — &,(x, t), from P, to E, is continuous. 

(e) For all « > 0 there exists a neighbourhood V, of the diagonal in EXE 
such that, for all m and all x,y € M,, (x: y:) € V. for i = 1,..., implies 
p(Ry(x, t), Ra(y, t)) < € for all ¢ € P,. 

Note that conditions (a) and (c) together imply that, if x € M, with 
%1 =... = X£, then k(x, t) = x; for all ¢ € P,. 


Definition 1.2. A subset S of a space E with convex structure is admissible 
if S* C M, for all n. If S is admissible, then the convex hulk of S, denoted 
by conv(S), is 


{k, (x, t) |x € S*, t€ P,, nm=1,2,...}. 


THEOREM 1.3. Let Y be a complete metric space with a convex structure, and 
let S be the family of non-empty admissible subsets of Y. Let X be paracompact, 
and o: X — © lower semi-continuous. Then there exists a continuous f: X — Y 
such that 


f(x) € [conv(¢(x))]- 
for allx € X. 


Definition 1.4. Let E be a metric space with convex structure. Then a 
subset S of E is convex if it is admissible and conv(S) C S. A family S of 
subsets is equt-locally convex if there exists an open covering $ of E such that, 
whenever S € © and B € &, then S()\ B is admissible and conv(S(\B) CS. 

Note that every family of convex sets is equi-locally convex. 
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THEOREM 1.5. Let X be paracompact, Y a metric space with convex structure, 
S a family of non-empty, complete subsets of Y, and ¢: X — S lower semi- 
continuous. Let A C X be closed and let g be a selection for |A. Then 

(a) If every S € S is convex, then g can be extended to a selection for ¢. 

(b) If S is equi-locally convex, then g can be extended to a selection for o|U 
for some open UDA. 


No discussion of spaces with a convex structure would be complete without 
mentioning their extension properties. For convenience, let us call a metric 
space E convex if it admits a convex structure making E itself a convex set. As 
a special case of Theorem 1.5, we see that a convex complete metric space 
is an AE (absolute extensor) for paracompact spaces (in the sense of (10)); 
a somewhat better result is obtainable directly as a straightforward general- 
ization of Dugundiji’s extension theorem (5, Theorem 4.1). A more interesting 
fact is a partial converse: Every compact metric space E, which is an AE for 
metric spaces, is convex. To see this, embed E in a Banach space B (8), and 
let F be the closed convex hull of Z in B. Then there exists a retraction r: FE, 
and we define k,: E” X P, ~ E by 


k, (x,t) = (> tas) : 


That conditions (a) — (d) of Definition 1.1 are satisfied is clear, while (e) 
follows from the factthat F is compact (9) and hence r uniformly continuous. 
It is not know whether compactness can be replaced by a weaker condition 
in the above result. 


2. Two lemmas, Our first lemma deals with the following very elementary 
concept. 


Definition 2.1. If X is a topological space, (Y, p) a metric space, and a > 0, 
then a function f: X — Y is a-continuous at x» € X if to every « > 0 there 
corresponds a neighbourhood U of x» such that 


p(f(x), f(xo))<at+e 


for every x in U. Moreover, f is a-continuous if it is a-continuous at every 
Xo EX. 

Note that clearly f is 0-continuous (at xo) if, and only if, f is continuous 
(at Xo). 


LemMa 2.2. Le X be a topological space, (Y,p) a metric space, and let 
fn: X — VY (n = 1,2,...) be @ sequence of functions which converges uniformly 
to a function f: X — Y. Let xo € X, and suppose that to each a > 0 and positive 
integer N corresponds an n > N such that f, is a-continuous at xo. Then f is 
continuous at Xo. 


Proof. Let « > 0. Pick an m such that p(f, (x), f(x)) < 4« for all x € X, and 
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fa is 4€-continuous at xo. Pick a neighbourhood U of x» such that, if x € U, 
then p(f,(x), fa(xo)) < $e. Then, if x € U, 


p(f (xo), f(%)) < pf (xo), fa(xo)) + e(fa(xo), falx) + e(fa(x), f(x)) < 


This completes the proof. 


Our second lemma concerns partitions of unity.‘ In it, as well as later in 
the paper, we denote the set of points where a real-valued function f is positive 
by P(f). An indexed family of real-valued functions {Pe}ae, will be called 
locally-finite if the indexed family of sets {P(pa)}aea is locally finite.® 


LEMMA 2.3. Let X be paracompact, Y a topological space, @: X — 2” lower 
semi-continuous, and {a_}nn1 @ sequence of continuous pseudometrics on Y. 
Then for each integer n > O there exists an index set A,, a locally finite partition 
of unity| Palaca, Om X, points ya(x) € o(x) whenever a € A, and x € P(ha), 
and a map x,: Anyi — A, onto, such that the following conditions are satisfied 
for all n. 

(a) Ifa € A, and x, x’ € P(pa)~, then on( a(x), Ya(x’)) <1. 

(b) If a € Ag, BE ma7*(a), and x € P(pg)~, then on(ya(x), ye(x)) < 1. 

(c) If a € Ag, then pa(x) = Z{pa(x)|8 © m,~'(a)} for all x € X. 


Proof. This theorem follows easily from (16, Lemma 2.1). The statement 
of that result paralleis that of our theorem, with the following differences: 
Instead of a partition of unity {Pa}aca, on X, (16, Lemma 2.1) only provided 
us with an open covering { Ua}aea, of X, the sets P(p,.)~ in (a) and (b) were 
replaced by U,, and (c) was replaced by 


(c’) Ifa € Ay, then U, = U { Us|8 € 2,-'(a)}. 


To obtain our partitions of unity, we proceed by induction, in such a way 
that 


(d) P(pa)— C Ua aé€ A, »=1,2,.... 


For m = 1, simply pick an open covering { Va}aea, of X such that V. C U, 
for alla € A:, and let {p.}aea, be a partition of unity subordinated® to { Va}aea;- 
Suppose we have {Pa}aeca,, and let us construct {s}5.a,,,- For convenience, 
we shall write z for z,. 

Let a € A,. Then {Ug C\ P(pa)~}ger-1(a) iS a relatively open covering of 
P(pa)~ by (c’) and (d), and hence has a relatively open refinement { Vg} g.e-1(2) 
with Vs C Us for all 8B € -"(a). Let {Gs}ser-1(@) be a partition of untiy on 


‘A partition of unity in a topological space is an indexed family {palaea of functions from 
X to (0, 1] such that Zaegpa(x) = 1 for all x € X. 

5An indexed family of sets { Ua}aea is called locally finite if each x € X has a neighbourhood 
intersecting U, for only finitely many a€ A. 

5{ Palaea is subordinated to { Ua\aca if Pa vanishes outside Uz for all a. Every open covering 
of a paracompact space has a partition of unty subordinated to it. 


| 
| 
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P(p.2)~ which is subordinated to { Vg}ser-1a). For each 8 € x~'(a), define 
pa: X — [0, 1] by 


P(x) - x ¢ P(pa); 
Pa(x) = pa(x) . g(x) x € P(p_)~. 


This function pz is well defined, and is continuous on X since it is continuous 
on two closed sets whose union is X. Moreover, {ps}s.a,,, is now clearly a 
partition of unity on X which satisfies (a) — (d). This completes the proof. 


3. Proof of Theorem 1.3. We begin by observing that conditions (c) 
and (e) of Definition 1.1 are equivalent to the following apparently stronger 
conditions (c’) and (e’). In fact, (c’) follows from (c) by straightforward 
induction, while (e’) follows from (e) by means of a standard result (see, 
for instance (11, Corollary 2.3)). 

1.1 (c’). Let m and m’ be positive integers with m < m’, y © Mn, t © Pn, 
y € My, and t’ € P,,. Suppose that, for some order-preserving map 


w:{l,...,m’}—{1,...,m}, we have 
Vs = ew) (= 4... 
t= 2 GM j=l,...,™ 
Then 


Rn(y, t) = Rm (y’, t’). 


1.1 (e’). For all integers » > 0 there exsits a continuous pseudometric ¢, 
on E with the following property: If x, y € M,, for some m, then o,(x;, y;) < 1 
fori = 1,...,m implies p(k,(x, t), kn(y,t)) <2 for all t € Pp. 

The main step in the roof of Theorem 1.3 will be the following lemma, 
whose proof, in turn, rests heavily on Lemma 2.3. 


LemMaA 3.1. With X, Y, and o: X — 2” as in Theorem 1.3, there exists a 
sequence of functions f,:X — Y (n = 1,2,...) such that, for all n, 

(a) fa(x) € conv(¢(x)) for all x € X, 

(b) f, ts 2-"-continuous, 

(c) p(falx), far (x)) < 2 for all x € X. 


Proof. We begin by applying Lemma 2.3, with o, (m = 1,2,...) as in 
1.1(e’) at the beginning of this section. Let A,, Pa, Ya(x), and 7, be as in Lemma 
2.3. Let us also suppose that the sets A, are well ordered in such a way that 
each x, is order preserving; this is easily done by induction. 

For each m and x € X, let 

Aa(x) = a € Ay| x € P(pa)}, 
A, (x) = {a € A,| x € P(p,)-}. 


Note that A,(x) C A,(x), and that both are finite. Let a,...,am be the 
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elements of A,(x) in order. Remembering that ¢(x) is admissible, we can 
now define f,(x) by 


(1) In(x) = Rm ( (Ye; (%), ~~ +» Vam(X)), (Pay (%),- ++» Pam(%))). 


Note that, since p,(x) = 0 if a¢ A,(x), it follows from Definition 1.1(b) that 
f,(x) is not changed by the omission from (1) of terms with index a ¢ A, (x). 

Let us check that our conditions are satisfied. 

(a) This follows from the definitions. 

(b) We will show that f, is 2-*-continuous at a given x» € X. Let 

V = \{P(pa)|a € An(x0)} — UfP(pa)-|a € (An — An(x0)}. 
Then V is a neighbourhood of xo, since {P(p2)}aea, iS locally finite. Now let 
x € V. Then ; A 
An(x0) C An(x) C An(x) C An (x0). 


So if a:,...,@m are the elements of A,(x) in order, we have, by the remark 
following (1) above, 


fa(x) = Rm( (Yas (X),- ++ + Yam(%)), (Par (%), ~~ +» Pam(%))), 
In(%0) = Rm( (Yas (X0), ~ + + + Vam(X0)), (Pay (X0), ~~ + » Pam (Xo))). 
Let 
hy (X, X0) = Rm ( (Yas (Xo), - + » Yam (%0)), (Pas (%), ~~ +» » Pam (%))). 
Clearly 
(2) p(fa(x), fa(xo)) < p(fa(x), hn(x, x0)) + p(hn(x, Xo), fa(x0)). 
Now 


t,% € Pi, ( = 1,..., 9); 
whence 


On(Ya;(X), Ya; (X0)) <1 
by 2.4(a), and hence, by 1.1(e’) at the beginning of this section, 
(3) p(fa(x), n(x, x0)) < 2. 


On the other hand, it follows from Definition 1.1(d) and the continuity of 


the functions .,,..., Pa, that to every « > 0 there corresponds a neigh- 
bourhood W of xo such that 


(4) p(hn(x, Xo), fa(x0)) < « ifx € W. 
Combining (2), (3), and (4), we conclude 
p(fn(x), fa(xo)) < 2"* + € ifxe VOW, 


and hence f, is 2~-"-continuous at x». 


(c) Pick a fixed m and x € X. Letting ai,..., a, be the elements of A, (x) 
in order, we have 


Fn(x) = Rm( (Yay (%), ~~ +» Vam(X)), (Pay (X), ~~~» Dam (X))); 
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letting y1,..., Ym be the elements of A,,:(x) in order, we have 


Fnrs(%) = Rane (Yas (%)s + + + Yom (%))» (Drs (%), « «+ + Pam’ (%))). 


Now let 
Ping a(%) = Rm ((Yoqerr (%), « «+ + Voncrm’ (%)), (Pr (%), « - «+ Pam’ (%)))- 
To prove (c), it will suffice to show that 
(5) fu(%) = Inys(x), 
(6) P(Ufn+1(*), hngi(x)) < 2. 
Let us prove (5). From lemma 2.3 (c) we see that, for each i < m’, x,(y,) = a, 
for some j < m. We can, therefore, define a map m: {1,..., m’} + {1,..., m} 
by 


a(i) =j if, and only if, Rn(V¥i) = a. 
It follows, again from Lemma 2.3(c), that 


Pa;(x) = ) — lips; (x) > © 


ier 


Remembering that x,, and hence 7, is order-preserving, we now use Definition 
1.1 (c’) (at the beginning of this section), with 


+ oe Ya; (x), ty - pa;(x), vi - Ventrs) (X), 
and 
t, = p,,(x) (jf = 1,...,m;i = 1,...,m’), 


to conclude (5). 
It remains to establish (6). But this is easy since 


on (Vy; (x), Ve(y;) (Xx) <1 j _ | er m’ 


by Lemma 2.3 (b), and hence (6) follows from Definition 1.1 (e’) at the 
beginning of this section. This completes the proof of the lemma. 


Theorem 1.3 is now a simple consequence of Lemma 3.1 and Lemma 2.2. 
In fact, the sequence 


{fa}nmt 


in Lemma 3.1 is uniformly Cauchy by 3.1 (c), and hence, by the completeness 
of Y, converges uniformly to an f:X —Y. This f is continuous by Lemma 
3.1 (b) and Lemma 2.2, and Lemma 3.1 (a) implies that f(x) € (conv(¢(x)))- 
for every x € X. Thus f has all the required properties. 


4. Proof of Theorem 1.5 from Theorem 1.3. Let X, A CX, Y, 
S C 2”, :X — G, and g: A —-Y be as in Theorem 1.5. Since all our assump- 
tions remain true if Y is replaced by its completion, we may as well assume 
that Y is complete. 
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(a) Assume that every S € © is convex. Define ¥: X — 2” by 


(1) v(x) = {g(x)} if xE€A 
v(x) =o(x) if x¢A. 


This ¥ is lower semi-continuous by (13, Example 1.3*), and ¥(x) is convex 
and closed for each x € X by the assumptions on ¢ and the remark following 
Definition 1.1. But Theorem 13 implies that such a ¥ must have a selection 
f, and this f is the required extension of g. 

(b) Assume that © is equi-locally convex. Let Y’ = U GS. If & is as in 
Definition 1.4, use (11, Corollary 3.2) to obtain a metric r on Y’, agreeing 
with the topology, such that the family of all r-spheres of radius 1 in Y’ is 
a refinement of 8. We thus have 

(2) If S€ S, and A CS has r-diameter <1, then A is admissible, and 
conv(A) C S, whence (conv(A))- C8 =S. 

Now let E be any Banach space, with metric d, containing (Y’, r) isometrically 
(8), let h: X — E be a continuous extension of g (1, Theorem 1.4), and let 


W = {x € X | d(h(x), o(x)) < 4}. 


Then obviously W C A, and W is open by the lower semi-continuity of ¢. 


Pick an open U C X such that AC UCU C W. Let y be as in (1) above, 
and define 6: U 2” by’ 


O(x) = w(x) 1\ Sy(h(x)). 


This @ is lower semi-continuous by (13, Proposition 2.5). Moreover, by (2) 
6(x) is admissible and [conv(@(x))]- C (x) for all x € X, and by the remark 
following Definition 1, conv(@(x)) = {g(x)} for all x € A. Our conclusion, 
therefore, follows from Theorem 1.3, with X replaced by U, and ¢ by @. 


5. Geodesic structures. In this section, we define geodesic structures, and 
show how they generate convex structures. The closed unit interval will 
always be denoted by J. 


Definition 5.1. A geodesic structure on a metric space E with metric p is 
a function k:M XI-—E (where MC EXE) satisfying the following 
conditions: 

(a) If (x, x) € M, then k(x, x,t) = x for all ¢ € J. 

(b) If (x1, x2) € M, then k(x, x2,0) = x; and k(x, x2, 1) = x2. 

(c) If (x1, x2) € M,t € I, and (k(x, x2, t), x2) € M, then k(R(x1, x2, t),x23s) 
= k(x,, x2, t + s(1 — 2)) for all s € J. 

(d) For all (x, x2) € M, the mapt — k(x, x2, t), from J to E, is continuous. 

(e) For each « > 0 there exist neigbourhoods W, C N, of the diagonal in 
E X E which are small* of order e, such that if (x1, x2) € Mand (y:, yz) € M, 


™We use S,(y) to denote the open r-sphere about y. 
8A neighbourhood U of the diagonal is small of order « if (x, y) € U implies p(x, y) < «. 

















ee 
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then (x;, yi) € N,. and (x2, y2) € W, implies that (k(x, x2, ), R(y1, yo, t)) € Ne 
for all ¢ € J. 

Notice that Definition 5.1 is quite similar to Definition 1.1 with n = 2, 
but that 5.1 (e) is distinctly stronger than 1.1 (e) with m = 2. This extra 
strength is needed to carry out the inductive proof of Proposition 5.3 below. 


Definition 5.2. Let E be a metric space with a geodesic structure. A subset S 
of E is geodesic if, whenever x,, x2 € S, then (x;, x2) € M and k(x, x2, t) € S 
for all ¢ € J. A family S of subsets of E is equi-locally geodesic if there exists 
an open covering & of EZ, such that, if S€ S, A € W, and x, x2 € (SMV A), 
then (x;, x2) € M and k(x, x2, t) € S for all ¢ € J. 

Note that any family of geodesic sets is equi-locally geodesic. 


PROPOSITION 5.3. If E is a metric space with geodesic structure (M,k), then 
there exists a convex structure {M,, Rn}nmi” on E such that 

(a) Every geodesic set S C E is convex. 

(b) Every equi-locally geodesic family of subsets of E is equi-locally convex. 


Proof. We define the sets M, C E" and functions k,: M, X P, —~ E in- 
ductively as follows. First, let 


M, = E, ki (x, 1) = |, 


Suppose M, and k, have been defined, and let us define M,,4; and k,4:. We 
introduce the following notation for the rest of this proof. 
(1) If x € E**', then % € E£" is defined by 


X, =X (¢ = 1,...,%). 


(2) If ¢ € Pass and t.4; ¥ 1, then 7 € P, is defined by 





one C5 PF (4 = 1, »n) 
Now let 
(3) Masi = {x € E**'|® € My, (hn(%, t), Xn41) € M for allt € Py}, 
_ 2Xn+1 Uf tn+i =1 
(4) Rn+1(x, t) as oe f), Xn+1) tn41) if ba+1 x a 


Let us check that the conditions of Definition 1.1 are satisfied. 

1.1 (a). This follows from our definition of :. 

1.1 (b). For n = 2, this follows from the definitions. Suppose that it holds 
for nm, and let us prove it for m + 1. Letx € Mazi, i Cn +1,andt€ Mays 
with ¢; = 0. If i = » + 1, then 0.x = &, which is in M, by (3), and &,4:(x, #) 
= k(k,(%, 2), xna1,0) = k,(%, 2) = k, (8x, 8¢), which is what we had to show. 

Suppose, then, that i ~ m + 1, and let us first show that 0 € M,. By (3), 
we must show that (0,.x)~ € M,-1, and that (R,_1((0)™, 5), Xa41) € M for 
all s € P,-1. Now (0,.x)~ = 0,%, and this is in M,_, by the inductive hypo- 
thesis. As for the other requirement, pick an s’ € P, such that s,/ = 0 and 
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d,s’ = s; then (Ra-1((9x)™, 5) ,Xn41) = (Rn—1(9%, O45’), Xng1) = (Rn (%, 5’), Xn+1) 
by the inductive hypothesis, and this last pair is in M by (3). Finally, let 
t € Mas: with ¢, = 0. If 4.4: = 1, then both sides of the equation in 1.1 (b) 
are X,41, and hence the equation is satisfied. If t,4; # 1, then (by the induction 
hypothesis) 


Rensi(x, t) = R( kn (%, 2), %n41) tats) 

R(R,-1(0 &, ad), Xn+1y bn+1) 
R(Ra-1((9 x) ™, (Od)™, Hust, bros) 
= k, (0x, 0), 


which is what we had to show. 

1.1 (c). In the presence of 1.1 (b), 1.1 (c) is equivalent to the following 
condition, which we are going to verify: If x € M, (m > 2) with x; = xi: for 
somei < n,andif t,t’ € P, witht, = tj forj] # 1,i + 1, thenk,(x,t) = k,(x, t’). 

For n = 2, this follows from 5.1 (a) and the definition of k,. Suppose it 
is true for m, and let us prove it for m + 1. 


Case 1. i < m. In this case, tpi. = toys’. If toga = 1, then Rayi(x, t) = Rass 
(x, t’) = xa1- If t41 ¥ 1, it suffices to show that &,(%, 2) = k,(%, 7’). But 
i, = tf for j ¥ i, i+ 1, so this follows from the inductive hypothesis. 


Case 2. i = n. In this case, x, = X,41, and 
n—1 n—1 
DD 4= DL tH. 
j=l j=l 
If this sum is 0, then Ry+1(x, f) = Re( (xn, Xn), (tn» tngi)) (by 1.1 (b)) = R(xn, Xn, 


tnai) (by (4)) = x, (by (5.1); similarly k,+:(x, t’) = x,. If this sum is not 
zero, then t, + fas1 = te’ + tess’ < 1, and 7 = 7’. Applying (4) twice we get 


Rn+1(x, t) - k(k(Ry-1(%, 2), Xny t, (1 — tnt1)~'), Xny ta+1) 
- k(kp-1(2, i), Xny tn + tn+1) by 5.1 (c). 
Similarly 


Ra+i(x, t’) = k(Ry—1 (2, i’), Xny yy + ta+1'). 


But ft, + tes = th’ + teas’, and 7 = 7’, as already observed. Hence k,41(x, #) 
= ky+1(x, t’), which is what had to be shown. 

1.1 (e). Proof by induction. The result is clear for k;. Suppose it is true 
for k,, and let us prove it for k,4;. Pick a fixed y € M, and s € P,, and let us 
show that the map ¢t — &,+:(y, ¢), from P,4; to E, is continuous at s. 


Case 1. S941 # 1. In this case, for some neighbourhood U of s in Py4:, 
t € U implies t,4: # 1. Hence, for ¢ € U, Raii(y, t) is given by the second 
formula in (4), and the required continuity follows from 5.1 (d) and our 
inductive hypothesis. 
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Case 2. Sn4i1 = 1. In this case Rayi(y, 5) = Yn41. We will show that k,4:(y, 2d) 
is close to Yn+1 if t.41 is close to 1. To do this, let A = {k(y, #)|t € P,} and, for 
each r € I, define f,: A ~ E by 


f(a) = R(a, Yn+1, 7)- 
Then 


Ra+ily, t) = finss(Q:) for somea,€ A if tay: #1 
Ransily, t) = Yn+1 if ba+t = 1. 


It follows that it is sufficient to show that f —/; uniformly as r — 1. That 
f.—fi pointwise follows from 5.1 (d). Moreover A is compact, and {f,},.1 is 
equicontinuous by 5.1 (e). Hence (see, for instance, (3, p. 34, Prop. 14)), 
the topologies of pointwise convergence and uniform convergence coincide 
on {f-}-«x1, and hence f, — f; uniformly as r — 1. 

1.1 (f). For « > 0, let V, = W., where W, is as in 5.1 (e). Let us show 
that this works. For k; this is clear. Suppose it works for k,, and let us prove 
it for Resi. If tog: = 1, then Ragi(x, ft) = X41 and Rasi(y, ft) = Yasir, and we 
know that p(%p41, Yes1) < € Since (X%p41, Yani) © We. If tess ¥ 1, then 


(5) Rana (x, t) = k(k,, (&, f), Xn+1) bn+1) 

Rasily, t) = k(k, (9, h), Yn+1s tn+1)- 
Now, since (x;,¥;) € W, for i= 1,...,"+1, we have (%,9,) € W, for 
i= 1,...,m, and hence our inductive hypothesis yields 


ky (%, 2), Ra(F,#)) € Ne. 
But (Xn41, Yn41) € W, by assumption, and hence, by (5) and 5.1(e), 
(R41 (x, t), Rasily, t)) E Na 


and now the desired conclusion follows from the fact that N, is small of 
order e. 

We have now completed the proof that the sets M, and functions &, defined 
in (3) and (4) are a convex structure on E. To complete the proof, we must 
check conditions (a) and (b) of our proposition. 

5.2 (a). Let S C E be geodesic, and let x € S". We must show that x € M,, 
and that &,(x, ¢) € S for all ¢ € P,. This is clear for » = 1. Let us therefore 
assume it for m, and prove it for m + 1. Now observe that, by the inductive 
hypothesis, ¥ € M, and k(%,s) € Sfor all s € P,. Since S is convex, it follows 
that x € Mas: by (3), and ka41(x, t) € S for all t € Pyoss by (4). 

5.2 (b). Let S be an equi-locally geodesic family of subsets of EZ, and let 
% be as in definition of this concept in Definition 5.2. For each x € E, pick 
an A, € & which contains x, and then pick r(x) > 0 such that S,,2)(x) C Az. 
Let B, = Wrz)(x), with W as in 5.1 (e), and let 8 = {B,},.g. Let us show 
that this 8 works. 

Let S € S, xo € E, and let T = S/\B,,. We must show that if x € 7", 
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then x € M,, and &,(x, t) € S for all t € P,. We will prove this inductively 
and, to keep the induction going, we will also show that, for all n, 


(6) (kn (x,t), x0) € Neep) forall # € Py, 


where NV...) is as in 5.1(e). Now for m = 1, all this is clear. Suppose it is 
true for nm, and let us prove it for m + 1. 
By the inductive hypothesis, ¥ € M, and 


k,(%,s) € (ST) Ag) 
for all s € P,. Since also 
Xnt+1 € (S C\ Ag), 


it follows that x € M,.,; by (3), and k,4,:‘x, t) € S for all t € Pass by (4). It 
remains to check (6) for m + 1. If t,.; = 1, this is clear, since then k,, (x,t) = x,41. 
If t.4:1 # 1, note that 


(R(X, 2), x0) € Noceo) 
by the inductive hypothesis, and 
(Xn+1, X0) © Woeee) 
by assumption. Hence 
(Ry (x, t), x0) = (R(Rn(%, 2), Xno1, tno1), Xo) © Nove) 


by (4) and 5.1 (c), and that is what had to be shown. This completes the 
proof. 

To conclude this section, let us record the following consequence of Propo- 
sition 5.3. 


THEOREM 5.4. Theorem 1.5 remains true if ‘‘(locally) convex’’ is replaced 


by “‘(locally) geodesic’. 


6. Riemannian manifolds. Let E be a Riemannian manifold with Rie- 
mannian metric p. If x1, x2 € X have a unique shortest geodesic joining them, 
this geodesic is called a segment. Let 


(1) L = {(x1, x2) € E X E\x,; and x2 are joined by a segment}. 

If (x1, x2) € L, then the segment from x; to x2 is given by a continuous 
€s:.23: 1 — E. 

Define h:L X I- E by 

(2) h(x1, X2, t) = ge, 2, (#). 


The elementary properties of geodesics imply that (L, h) satisfies conditions 
(a) — (d) of Definition 5.1. In general, of course, 5.1 (e) is not satisfied. How- 
ever we have the following theorem of Nijenhuis (17). 
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THEOREM. Every p € E has a spherical neighbourhood V(p) with the following 
property: 

(a) If x1, x2 € V(p), them (x1, x2) € L, and h(x, xo, t) € V(p) for allt € I. 

(b) If e > 0, and x, x2, ¥1, ¥2 € V(p) with p(x, v1) < € and p(x2, ¥2) < fe, 
then p(h(x1, X2, t), h(y1, ¥2, t)) < € for all t € I. 


It follows immediately from the above theorem that, if M = V(p) x V(p) 
and k = h|M, then (M,jh), in addition to satisfying conditions (a) — (d) 
of Definition 5.1, also satisfies condition (e) (by taking NV, to be the e-neigh- 
bourhood of the diagonal, and W, the 4«-neighbourhood), and hence we 
conclude 


PROPOSITION 6.1. Every point p of a Riemannian manifold E has a spherical 
neighbourhood V(p) such that every pair of points in V(p) is joined by a unique 
geodesic segment, and these geodesic segments generate a geodesic structure on 
V(p) under which V(p) is a geodesic set. 


While the above proposition is sufficient for our application to Lie groups 
in the next section, let us conclude this section by proving the following more 
“global” result. 


PROPOSITION 6.2. On every compact Riemannian manifold E there exists a 
geodesic structure (M, k) such that 

(a) If L and h are as in (1) and (2), then M C Landk = h\M. 

(b) Every p € E has a neighbourhood U(p) such that U(p)K U(p) C M. 


Proof. For each p € E, let S(p) be the open sphere about » whose radius 
is half the radius of the sphere V(p) in Nijenhuis’s theorem. Then {S()},.z 
is an open covering of E, and by compactness there exsits a finite subcovering 
{S(p.)} m1". With L and h as in (1) and (2), let 


M = {(x, y) € L|x,y € S(p,) for somei = 1,...,n} 


(3) k = h|M 


The only requirement that needs further checking is that (M, zk) satisfies 
condition (e) of Definition 5.1. 

Let « > 0, and let us define the required neighbourhoods W, and JN, of 
the diagonal. For each i, let r; be the radius of S(p,), and let 

y = min(min{r,ji = 1,..., }, €). 

Now let 
N. = {(x,y) € E X Elp(x, y) < y}, 
W. = {(x,y) € EX Elp(x, y) < 4}. 


To see that this works, suppose that (x:, x2) € M, (v1, ¥2) € M, and that 
(x1, v1) € N, and (x2, y2) € W,. Since (x1, x2) € M, it follows from (3) that 
there exists a positive integer k < m such that 


X1, X2 © S(px) C V( px). 


(4) 
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Since p(x;, ys) < y < 7, for i = 1, 2, it follows that 
Vi, V2 © Vi py). 


Finally, observe that p(x:, yi) < y, while (x2, y2) < $y. We therefore apply 
Nijenhuis’s theorem to V(p,) to conclude that, for all ¢ € J, p(k(x:, y:, 2), 
k(x2, ¥2, t)) < ¥ < €, which is what we had to show. 


7. Locally convex groups. The reader is reminded that, as elsewhere in 
this paper, A” always (even in a group) denotes the n-fold Cartesian product 
of A for any positive integer m. Moreover, if G is a group, x € G", and z € G, 
then xz will denote (x.z, ...,x,z) € G", while if K C G" and z € G, then Kz 
will denote {xz/x € K} C G". 


Definition 7.1. A metrizable topological group H with right-invariant 
metric d is locally convex if there exists a convex structure {L,, A,},<1° on H, 
and a neighbourhood W of the identity, such that the following conditions 
are satisfied for all n. 

(a) L, = W". 

(b) hk, is right-invariant; that is, if x € W", 2 € H, and xz € W", then 
h, (xz, t) = (A, (x, t))z for all ¢ € P,. 

(c) The left and right uniform structures on H coincide on W. 

(d) Condition (e) of Definition 1.1 is satisfied with V, of the special form 
{(x, y) € E X Eld(x, y) < 4(e)} for some 4(e) > 0. 

If we can take W = H, then H is called a convex group. 

Note that this definition does not depend on which right-invariant metric 
is used, since any two such metrics are uniformly equivalent. 

The following are some examples of locally convex groups. 

(1) A locally convex metrizable topological linear space, with the usual 
convex structure. (This group is, in fact, convex.) 

(2) A Lie group. A Lie group can be made into a Riemannian manifold 
with right-invariant Riemannian metric. By Proposition 6.1, there exists a 
neighbourhood U of e on which the segments generate a geodesic structure 
(M, k) making U into a geodesic set. Since the metric is right-invariant, so 
is k. By Proposition 5.3, this geodesic structure generates a convex structure 
{M,, Rajnei”’ on U which makes U a convex set; hence M, = U". From the 
way k, is defined in terms of & in the proof of Proposition 5.3, it follows that 
each &, is right-invariant along with k. Now let W be a compact subneigh- 
bourhood of U, and let 4, = k,|W”. Then all requirements are satisfied, the 
compactness of W taking care of 7.1 (c) and (d). 

(3) The multiplicative group H, of invertible elements in a Banach algebra 
B with unit e. Let W be the sphere of radius $ about e. For the convex structure, 
use the ordinary linear one. The right and left uniform structures on H 
coincide on W with the uniform structure induced by the norm. More gener- 
ally, one can take any group of the form Hy, (\ (J + e), where J is an ideal in B. 
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(4) A topological group which is locally isomorphic to a locally convex 
group. 
(5) A finite Cartesian product of locally convex groups. 


THEOREM 7.2. If G is a metrizable group, and H a closed, complete (locally), 
convex subgroup, then there exists a (local) cross-section.® 


Proof. Let p be a left invariant metric on G. We shall construct a convex 
structure {M,, k,x}nm:1” on (G, p) such that the collection § of right cosets of 
H becomes equi-locally convex (and such that each right coset of H becomes 
a convex set in case H is convex.) The theorem then follows from Theorem 1.5, 
which applies because the natural map u: G — G/H is open, and hence the 
carrier ¢: G/H — 2%, defined by ¢(x) = u~'(x), is lower semi-continuous (13, 
Example 1.3*). 

By 1.1 (e) and the remark following Definition 1.1, there exists a neigh- 
bourhood W, of the identity e such that, whenever x € W,", then A, (x,t) € W 
for allt € P,. Pick asymmetric neighbourhood W; of esuch that W,. W: C W; 
Now let 

M, 
M, 


Define k,’: M,’ X P, ~G by 
ka(x,t) = (h,(xe™", #)) 2, 


where z is any element of G such that x € W,"z, and let 


Il 


Ul Wy2lz € G}, 
Ul Wt2|z € G}. 


k, = k,|(M, X Py). 


It follows from Definition 7.1 (b) that the definition of &,’, and hence that 
of k,, does not depend on the choice of z. That {M,, R,},.:° satisfies con- 
ditions (a) — (d) of Definition 1.1 follows from the fact that {Ly, Aa}nes” 
satisfies them. It remains to verify that {M,, k,},.:° satisfies condition (e) of 
Definition 1.1. 

Observe first that, for any topological group G and any z € G, right trans- 
lation by z (g — xz) is a uniformity isomorphism for both the right and left 
uniform structures on G. For the right one this is clear. For the left one, 
it follows from the fact that our map is a composition of x — z~'xz and 
y — zy; the first of these is a group isomorphism, and the second one is a left 
translation. 

It follows from the previous paragraph, applied to the left uniform structure, 
that 7.1(c) and (d) imply 


(1) To every z € G and « > 0 corresponds a 5(¢, 2) > 0 such that, for all n, 
if x,y € Wi"z and p(x: y:) < (e,2) for i=1,...,m, then p(k,’(x,?), 
kn’ (y, t) < € for all t € Py. 


See footnote 2, p. 555. 
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Now for each « > 0 and z € G, let 
(2) &(e, 2) = 45(4e, 2). 
(3) V. = {(x, y) € G K Gix, y € See.) (z) for some z € G}. 


Let us check that this V, works. We must show that, if x, y € M,, and if 
for each 1 = 1,...,m there exists a 2; € G such that 


Xiy Vi E Sece.24) (24); 
then 


(4) p(k, (x, t), k,(y, t))<e 


for all ¢ € P,. 

Pick k<m such that £(e, 2) = max,,"t(e,2,), and let a@ = x,~'z, 
b = y,—'z,. Note that, since p is left invariant, p(g, ga) = p(x, 2) < E(e, Z) 
and p(g, gb) = p(yx, 2) < E(e,z%) for all g€G. For i=1,...,m, let 
x; = xa, yi’ = yb. Then 


p(x’, v4) < oxi, x1) + p(x ys) + lyn Vt) < 4E(e, Ze) = FCF, Ze). 


Now x € M,, so x € W;"r for some r € G, and hence x’ € W,"ra. Thus, for 
t=1,...,m”, x € Wera = W2(rx,—")e. C (We. W2)e C Wizz; in other 
words, x’ € W,"z,. Similarly, y’ € W,"z,. We can therefore, apply (1), with 
Z = Z, to conclude that p(k,’ (x’, t), Ry’ (y’, t)) < 4e for all t € P,. 

Now note that, for all ¢ € P,, 


Ri (x', t) = hy(x’sy", t) te = hy(xxz", t) a = k, (x, ta, 


and hence p(k’, (x’, t), a(x, t)) < E(e, 2). Similarly p(k,’(y’, t), Ra(y, 8) < € 
(e, 2) for all ¢ € P,. Hence, for all ¢ € P,, 


(R(x, t), k,(y, t)) 
< phn (x, t), bn(x’, t)) + plkn(x’, t), Rn(y’, t)) + p(Rn(y’, t), Rn(y, #)) 


which is what we had to show. 

To complete the proof, we must show that the family § of right cosets 
of H is equi-locally convex, and that each right coset of H is a convex set 
in case H is convex. The latter assertion is clear. To prove the former, let d 
be a right invariant metric on G, and pick r > 0 such that the d-sphere of 
radius r about the identity e in H is contained in W:. Since x € M, if, and 
only if, x1, ..., 2 are all in some right translate of W2, it follows that x € M, 
whenever %1,...,X, is a subset of some member of § having d-diameter 
<r. Hence for 8 we simply pick the family of d-spheres of radius 47 with 
centres in G. 

Combining Theorem 7.2 with Example (2) yields a new proof—at least in 
metrizable case—of a cross-section theorem of A. Gleason. When combined 
with Examples (1) and (4), Theorem 7.2 yields 
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CoROLLARY 7.3. Let G be a metrizable group, and H a closed subgroup which 
is (locally) isomorphic to the additive group of a complete, metrizable, locally 
convex topological linear space. Then there exists a (local) cross section.'® 


This corollary generalizes a result of Bartle and Graves (2, Theorem 4) 
(see also (13, p. 364)), where G is assumed to be a Banach space and H a 
closed linear subspace. 


8. Homotopy extension, covering homotopy and fibre spaces. A 
common feature of both the homotopy extension theorem (6, Theorem VI, 
5) and the covering homotopy theorem (19, Theorem 11.7) is that, speaking 
very roughly, special conditions on the domain permit one to obtain a con- 
tinuous function globally even though the range is well behaved only locally. In 
this section we obtain a result (Theorem 8.3) on continuous selections which 
seems to incorporate the essential aspects of both these theorems. For finite 
dimensional domains, this was already done in (15, Theorem 3.4), and both 
statement and proof of Theorem 8.3 parallel those of its finite-dimensional 
analogue. After developing the necessary preliminary concepts, we will thus 
be able to dispose of the proof of Theorem 8.3 simply by pointing out the 
obvious modifications which must be made in the proof of (15, Theorem 3.4). 


Définition 8.1. Let E be a metric space with convex structure. Then a 
family S of subsets of E is uniformly equi-locally convex if there exists an 
r > 0 such that, whenever S € S and A CS has diameter < 17, then A is 
admissible and conv(A) C S. 

Note that any uniformly equi-locally convex family of sets is equi-locally 
convex. 


Using Theorem 1.3, we now prove 


PROPOSITION 8.2. Let (Y,p) be a metric space with convex structure, S a 
uniformly equi-locally convex family of complete, non-empty subsets of Y, and 
let r > O be as in Definition 8.1. Let X be paracompact, ¢: X — S lower semi- 
continuous, and suppose there exists a continuous g:X — Y such that p(g(x), 
o(x)) <r for every x € X. Then there exists a selection for 9. 


Proof. Since all assumptions remain unchanged if Y is replaced by its 
completion, we may assume that Y is complete. Define y: X — 2” by 


v(x) = o(x) 1 S,(g(x)). 
Then y is lower semi-continuous by (13, Proposition 2.5) and, for every 


x € X, ¥(x) is admissible and conv(¥(x)) C $(x). Hence Theorem 1.3 asserts 
the existence of a continuous f: X — Y such that, for every x € X, 


f(x) € [conv(y(x))]}- C o(x). 


This f is the required selection, and the proof is complete. 


See footnote 2, p. 556. 
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The previous theorems in this paper only required the carrier ¢: X — 2” 
to be lower semi-continuous. In the following theorem, however, just as in 
[15], we require it to be continuous; that is, given « > 0, every xo € X hasa 
neighbourhood U such that 


$(x0) C S.(O(x)), (x) C S.(G(%0)) 
for every x € U. 
The principal result of this section can now be stated as follows. 


THEOREM 8.3. Let Y be a metric space with convex structure, and S a uniformly 
equi-locally convex family of complete non-empty subsets of Y. Let Z be para- 
compact, X = Z X I, and ¢: X — S continuous. Finally, let B C Z be closed, 
and define A C X by 


A = (ZX {0})U (BX D. 
Then every selection for ¢|A can be extended to a selection for 9. 


Proof. This theorem is identical with (15, Theorem 3.4) except that no 
dimensional restrictions are placed on Z, and that © is uniformly equi-locally 
convex instead of uniformly equi-LC". The proof of our theorem is similarly 
identical with the first part of the proof of (15, Theorem 3.4) (the second part 
takes care of a dimensional difficulty), provided “uniformly equi-LC”” is 
replaced by “uniformly equi-locally convex,” and the reference to (14, Theorem 
9.1) in the proof of Lemma 3.3 of (15) is replaced by a reference to our Propo- 
sition 8.2. This is all that need be said, and the proof is thus complete. 


Before continuing, let us observe that if every S € © were actually convex 
in Theorem 8.3, then this theorem would be a special case of Theorem 1.5(a). 
Note also that if ¢ is a constant map, then Theorem 8.3 is simply a homotopy 
extension theorem, while if B is empty, we get a theorem of the covering- 
homotopy type. 

As mentioned at the beginning of this section, Theorem 8.3 is valid because 
of the special relation of X to A C X. However, the relation need not be 
quite as special as all that. In fact, it is an easy consequence of Theorem 8.3 
that it is sufficient to assume that X is paracompact, and that A is a generalized 
deformation retract of X in the following sense: There exists a continuous 
r:X X I-A such that r(x,0) =x if x € X, r(x,1) € A if x € X, and 
r(x,t) € A if x € A and ¢ € I. The simple proof of this generalization from 
Theorem 8.3 can be omitted, since it is identical with the proof of how Theorem 
6.2 follows from Lemma 5.3 in (15). 

Just as (15, Theorem 3.4) was applicable to fibre spaces in the sense of 
Serre (18), Theorem 8.3 is applicable to fibre spaces in the sense of Hurewicz 
(7). These are, by definition, triples (EZ, », B), where E and B are topological 
spaces with »: E — 8B continuous and onto, satisfying the following con- 
dition: If Z is paracompact, k: Z X I — B continuous, and if ¢:Z KX I— 2 
is defined by ¢(z, 4) = p~'(R(z, t)), then every selection for ¢|Z XK {0} can 
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be extended to a selection for ¢. The following result is now an immediate 
consequence of Theorem 8.3. 


Coro.iary 8.4. Let E, B be metric spaces, p: E-—» B continuous and onto, 
and suppose that 

(a) each p~'(x) is complete, 

(b) {p-"(x)} ecm ts uniformly equi-locally convex, 

(c) the carrier y: B — 2*, defined by (x) = p-'(x), is continuous. 





Then (E, p, B) is a fibre space in the sense of Hurewicz (7). 
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A NOTE ON HYPERCONVEXITY IN 
RIEMANNIAN MANIFOLDS 


ALBERT NIJENHUIS 


1. Summary. Let M denote a connected Riemannian manifold of class 
C*, with positive definite C? metric. The curvature tensor then exists, and 
is continuous. 

By a classical theorem of J. H. C. Whitehead (1), every point x of M has 
the property that all sufficiently small spherical neighbourhoods V of x are 
convex; that is, (i) to every y,z € V there is one and only one geodesic seg- 
ment yz in M which is the shortest path joining them: f:[0, 1] ~ M,f(0) = y, 
f(1) = 2; and (ii) this segment yz lies entirely in V: f((0, 1]) C V; (iii) if f is 
parametrized proportional to arc length, then f(#) is a C* function of y, ¢, 
and z. 

Let V be a convex set in M; and let y;, yo, 21, 22 € V. Let fi, fe: (0, 1] ~ V 
denote the geodesic segments ,2:, yoz2, each parametrized proportional to 
arc length. Then for each ¢ the points f,(t), f2(t) are called corresponding points 
of the geodesic segments 2, yot%2. In particular, y:, y2 are corresponding 
points; and so are 2, 22. 

The distance between points x and y, denoted by p(x, y), is the greatest 
lower bound of the lengths of rectifiable paths joining x and y. The diameter 
D(A) of a set A is, as usual 


D(A) = op p(x, y). 


Let V be a convex open set of M, and y a positive number. Then V is 
called y-hyperconvex if for every positive number e and any geodesic segments 
121, Yo%2 in V, the inequalities p(y., yz) < €, p(z:, 22) < ye imply that corre- 
sponding points of 121, y2t2 have distance less than «. Clearly, y has to 
satisfy y < 1. 

If V is y-hyperconvex, and W is a convex subset of V, then also W is 
y-hyperconvex. If V is y-hyperconvex, and if 0 < 7’ < y, then V is also 
+'-hyperconvex. 


THEOREM I. Every point of a Riemannian manifold has a }-hyperconvex 
neighbourhood. 


Received June 2, 1958. Research supported by an Office of Naval Research Contract at the 
University of Washington. 
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This theorem*® is a corollary of Theorem II, which makes use of the concept 
of maximal curvature, defined as follows. 

If uw and » are tangent vectors at some x € M, with components #, »* 
(x, A, w, ¥)..., = 1,...,) with respect to some local co-ordinates; and if 
Koy, are the covariant components of the curvature tensor Ki,, then 
K(u,v) is defined as — Kay, uv uv’. When u and v are perpendicular unit 
vectors, K(u,v) is the sectional curvature, which clearly depends only on the 
2-plane at x, spanned by u and »v. If A isa subset of M, then k(A), the maximum 
curvature of A, denotes the least upper bound of all numbers K (u,v), with 
u,v being perpendicular unit vectors spanning all 2-planes at all points of 
A. k(A) may be +o. 

Our main result is the following theorem. 


THEOREM II. If the maximum curvature k = k(V) of a convex open set V 
in M is non-positive: k < 0, then V is 1-hyperconvex. If k > 0, and if the dia- 
meter D = D(V) is such that kD*® < 4*/4, then V is y-hyperconvex, where 
y = cos Dk"?, 


The case k < 0 is a much weakened formulation of results obtained by 
H. Busemann (3, Theorems (36.4) and (36.17)); the first of which states 
that the distance between corresponding points of geodesics in a G-space is 
a convex function of the linear parameter on the geodesics. Since, however, 
the case k < 0 is naturally included in our line of argument, the reader will 
find a new proof for this result. 

Every sufficiently small spherical neighbourhood V of a point x € M is 
convex and has compact closure, which implies that k(V) is finite. Since 
k(V) is non-increasing when the radius of V tends to zero, it follows that 
k(V)D(V)? tends to zero as the radius of V approaches zero. This proves 
Theorem I, assuming Theorem II. 

The proof of Theorem II is based on estimates of the solution of systems 
of linear differential equations whose prototype is y’’ = f(x)y. 


2. Estimates on certain systems of linear differential equations. 


LemMA 1. Let K(s) be a continuous family of linear transformations in E", 
with origin O, defined in the finite closed interval 0 < s < 1. Let y(s) bea solution 
of y + K(s)y = 0 in the interval (0, 1], and let S denote the unit sphere around 
O. Then there is a C* path x(s) on S such that for each s € [0,1], the points O, 
x(s), y(s) are collinear (x(s) is called a spherical image of y(s)). 


*This statement was conjectured by E. A. Michael in a slightly weaker form obtained by 
replacing 4-hyperconvexity of a convex set V by the following property: to every « > 0 there 
is a 8 > O such that if 1, ys, 21, 22 € V, p(y1, ¥2) < €, p(2:, 22) < 8, then the distance between 
corresponding points of ¥:, 21, ¥2%2 is less than «. This conjecture was submitted by Dr. Michael 
to a number of mathematicians, including H. E. Rauch, who vouched for it, and L. W. Green, 
who obtained a written proof (spring, 1957) of the weaker result which one gets by permitting 
8 to be equal to zero. Theorem I, in its present form, has been applied in (2) to the theory of 
continuous selections. 
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Proof. lf y = 0 the lemma is trivial, because any C* path x(s) on S isa 
spherical image. If y(s) # 0 for all s € [0,2] the proof is simple because 
x(s) = y(s) . |y(s)|"' is a spherical image. Now assume that y(s) has the 
zeros $1, S2,..., but y # 0. The number N of these s, must be finite, because 
at any accumulation point s’ one has 


y(s’) = y(s’) = 0; 
hence y = 0. Consider 


z(s) = y(s)(s — si)~'(s — s2)7'... (s — Sy) 


for s # s1,...,5y, and 


z(s,) = lim z(s) = y(s,) [] (si — s)™. 


885 ji 


Then z(s) # 0 for all s € [0,7] because 
y(s,) #0. 


Every spherical image of z(s) is a spherical image of y(s); and thus the 
problem has been reduced to a previous case provided z(s) is of class C? for 
S$ = S;,...,5y. In verifying this, the following simple application of the 
mean value theorem is helpful: “Jf a function f is continuous in [a, 6], differ- 
entiable at all points of |a, b] except some c € [a, b], and if 
lim f’(s) = L 

exists, then f'(c) exists and equals L; whence f' is continuous at c’’. To show 
that z(s) is of class C? at s, it suffices to show this for z,(s) = y(s)(s — s,="'. 
We use de I’Hopital’s Rule: 





lim (s) = tim Zs) — $d Ze) .. tm a = — 3K(s,)y(s,) = 0. 
Hence, z;(s,) = 0, and z, is continuous in a neighbourhood of s, in [0, /]. The 
procedure is repeated for Z;, and one thus finds z,(s,) = — $K(s,)y(s,); and 
Z,(s) is continuous at s;. Hence, z(s) exists and is continuous in (0, /]. 


LemMA 2. If K(s) is a continuous family of linear transformations in E"; 
0<s <1, and m is an upper bound for the inner product (u, K(s)u) for all 
s € [0, 1] and all unit vectors u; if y(s) is a solution of 


y + K(s)y = 0, 


and if x(s) is a spherical image of y(s); with y(s) = d(s)x(s); then d satisfies 
a differential equation \(s) + o(s)A(s) = 0, where $(s) is continuous in s; 
0<s<l; and $(s) < m. 


Proof. Since y(s) and x(s) are of class C*, and x(s) # 0; A(s) is also of 
class C*. Substituting y = Ax into y + K(s)y = 0 one finds Ax + 2hx + dx 
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+ Kx = 0. Take the inner product with x, remembering (x,x) = 0 and 
(x,x) + (x,x) = 0. Thus follows \ + {(x, Kx) — (x,x)}A = 0. Hence, 
¢(s) = (x, Kx) — (x,x) satisfies, and is continuous. In case y # 0, ¢(s) is 
unique. Furthermore, @ < (x, Kx) < m. 


LemMMA 3. If K(s) is as before; m as in Lemma 2, then ml* < x* implies 
that no non-trivial solution of y + K(s)y = 0, with y(0) = 0 has a zero in the 
interval 0 < s < l. Furthermore, for all s € (0,1), every solution with y(0) = 0 
satisfies 


sin m's 


lys)\i< say lO), m>0, 
and 
¥SI<jiyO|, m<o. 


Proof. There is no restriction in assuming m > 0 and y # 0. Every zero 
of y(s) is a zero of ly(s)| = +A(s). A satisfies X + ¢\ = 0; A(O) = 0, with 
@ < _m. The function ¥(s) = asin ms if m > 0, and ¥(s) = Bs if m= 0 (a 
and 8 any non-zero numbers) are solutions of » + my = 0, with ¥(0) = 0. 
The smallest positive zero of the first function is rm, which does not lie 
in [0, 1] because xm-! > (mi?)im-) = 1. Thus neither function has a positive 
zero in the interval [0,/]. A well-known Sturmian theorem states that every 
positive zero of A is preceded by one of ¥, which means, in this case, that A 
has no positive zero in [0, /]. In order to prove this, suppose that \ has positive 
zeros in [0,/]; let s’ be the smallest one. One may assume without loss of 
generality that A(s) >O and y¥(s) >0 for s € [0,s’]; then A(s’) = 0, 
Ks’) < 0, ¥(s’) > 0. Further, m — ¢ is strictly positive in some sub-interval 
of (0, s’]; otherwise m = @ in [0,s’]; and y is a multiple of A; hence 
¥(s’) = A(s’) = 0, which is false. We have now 


iy — aviv = [Gv — avy as = [avin — 6) as > 0, 
and also 
Av — Av|o = As’) o(s’) — ACs") Vs’) 
— 40) ¥0) + 0) 40) = A"V(s") <0. 


Thus the assumption that A has zeros in [0,/] leads to a contradiction. This 
proves the first statement of the lemma; the second part uses the same formu- 
las. The function \/y is defined and of class C' for all s € [0,1]; for s = 0 by 
limit procedure. Take A > 0 in [0, /], and 


sin sm? 
sin Im 


¥(s) = 


respectively; then A(/) = ¥(J). For any s, with 0 < s <1, we have 


a2 = 0), ¥(s) = 50) 
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= [rvem - 4) >0. 


Since ¥(0) = 0, we have y¥*(0)(A/¥)o = 0; and thus follows (A/y)’ > 0 for 
all s, and \/y¥ is non-decreasing. Consequently, 


(3),<@),- 8 


that is, A(s) < ¥(s). This completes the second part. 


“= iy — ay 
0 


Lemma 4. For every value of t in the interval 0 < t < 1 the function y; defined 
by ¥(t,a) = sin fa/sina when a #0, and by y(t,0) = t when a = 0, is a 
monotone non-decreasing function of a in the interval 0 < a < x. 


Proof. The cases t = 0 and ¢ = 1 are trivial. Take 0 <t << 1;0 <a < gz, 
then 
dv(t,a) oN 


da sin’ a’ 





where 


N = tcostasina — sin facosa = sin (1 — t)a — (1 — #) cosfasina 


> sina(1 — t) — (1 — #) sina = a(1 — 2) ja i — Ges _ sas «| > 0, 
(1 — t)a a 
because sin a/a is a decreasing function of a in the interval 0 < a < x. Hence, 
dy/da > 0 when 0 < a < =; since y(t, a) is continuous in a at a = 0, it is 
monotone non-decreasing in a in the interval 0 < a < =. 


3. Geodesic segments connecting a fixed point with the points of 
another curve. We return to the geometric situation of § 1. In this section, 
the results are true under the assumption kD* < r*. Not until the proof in 
§ 4 do we need the stronger condition kD*® < r?/4. When k = 0, D = +, 
the expression kD? is interpreted as zero. 


Lemma 5. In a convex set V in M, whose diameter and maximum curvature k 
satisfy RD® < x’, let x be any point, and y, (0 < r < 1) a C' path. The geodesic 
segment xy, is parametrized by t from 0 to 1 proportional to arc length: (t, r) 


f(t, r); f, r) = x, f(l, r) = y-. If 


v(t,r) = ne 


denotes the tangent vector to the path r — f(t, r), for fixed t; and |, denotes the 
length of xy., then 


sin tl,k” 
et, r)1< sin 1,k'”* 





v(1, r)| 











or 


a 


the 
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if k > 0; and 
lv(t, r)| < tv(1,7r)| ifk <0. 


Proof. lf s denotes the arc length function on xy,, measured from x to y,, and 
§/és denotes covariant differentiation with respect to s, then the components 
v* of v satisfy the classical Jacobi equation 

2 


6 « Y gs, 
502” +v Kyi thm = 0, 


where 7 is the unit tangent vector to xy,. Choose along xy, a self-parallel 
orthonormal moving frame (¢:,..., é,), with i = e;. Denote the components 
of v with respect to this frame by y = (v',...,v"), and let K(s) be the 
linear transformation 


y — K(s)y = (o’K i) - ,v Kit), 
where now all components of Kj," are taken with respect to (e:,..., é,). Then 
y + K(s)y = 0; with, of course, y(0) = 0. We claim that m = max {k, 0} is 
an upper bound for the i inner product (u, K(s)u), with |u|] = 1. Decompose u 


into u’ along i, and uw” perpendicular to i. Then 


(u, K(s)u) = Koa,w Pru’ = — Ky, u'?w? = K(u, i) 
= K(u”,i) < |ju”|.k < ju”’|.m < m. 


Since 1, < D, we have ml,? < x’, and thus, by Lemma 3, we have 


sin sm} 


lv(s)| < S575 IG), vs) <7 lyd.)| 


sin l,m 


respectively for k = m > 0 and m = 0 (k < 0). In terms of the parameter 
t = s/l, this is precisely the statement of the lemma. 


Lemma 6. Let V, k, D, x, y,, f(t, 7) be as in Lemma 5, but let yoy:: rf (1, r) =9- 
be a geodesic segment of length L, parametrized by + proportional to arc length. 
Then 


: ‘ 
o(f(t, 0), f(t, 1) < 2K 1, (ft, 0), f(t, 1)) < tL 


respectively in the cases k > 0 and k < 0. 


Proof. We have p(yo, y:) = L, |v(1, t)| = L; and p(f(t, 0), f(t, 1)) is majorized 
by the length of the path r > f(t, r), r € (0, 1]. The latter equals f ,'|o(t, r)|\dr. 
In case k = m > 0, we have |,k! < Dk! < x. Lemmas 4 and 5 (with a = /,k 
and a = Dk) are now applied: 


Ft). D) < fot, rrlde <7 f SE de < BK 7, 





o(f(t, 0), f(t, 1)) < Lf we 


(cases k > 0 and k < 0 respectively). 











582 ALBERT NIJENHUIS 


4. Proof of Theorem II. Choose a positive number «. Let y:, ys, 21, 22 3 V; 
and p(y1, ¥2) < €, p(1, 22) < ye, where y = cos Dk! if k > 0, and y = 1 if 
k < 0. If ¢, x, ¥: (0, 1] — V are parametrizations proportional to arc length 
of the geodesic segments 121, Y221, Yot2 respectively, then by Lemma 6 we 
have, for k > 0, Dk? < 4x: 

p(o(t), H(t)) < p(o(Z), x¢)) + o( (4), x(@)) 
. ‘ . D 
<9 LGA sy Se 
< sin (1 - t)Dk* + ¢cos Dk sin tDk* 
sin Dk’ 
sin Dk* cos tDk* — cos Dk' sin tDk* + cos Dk' sin tDk' 
. sin Dk* 








ecos tDk' < «; 
and for k < 0: p(¢(t), ¥(t)) < (1 — de + te = «. This completes the proof. 


Final Remark. The lemmas derived in §§ 2 and 3, can be used to deduce 
more results on upper bounds for the distance between corresponding points 
of geodesic segments in convex sets with either kD* < x? or RD* < }x*. The 
approach to take seems so obvious that there is no need here to amplify this 
point with a number of examples. 
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THE a-LOCAL OPERATOR PROBLEM 
LOUIS DE BRANGES 


Let f(t) = Se**du(x) be the Fourier transform of a Borel measure of finite 
total variation. The formula 


f(t) = fet*ix du(x) 
can be justified if the integral on the right converges absolutely. For 








fe + bh) — fH _ f we = 1. 
h = jo a * eG) 
where 
e* — 1 >. = 
ich a3| <* 











Now let 4 — 0 in both sides of this equation and use the Lebesgue dominated 
convergence theorem. 

The formula suggests a concept of ‘‘derivative’’ for such absolutely con- 
vergent Fourier transforms. Notationally it is easier to drop a factor of 4. 
Define an operator H to act on f(t) by 


H-f(t) = fet*x du(x) 
whenever 


S\x du(x)| < @. 


The operator H corresponds to — i times differentiation and we have the 
formula 





H-f(t) = tim + HL 


whenever f(t) is in the domain of H. In (2), this formula was shown to have 
interesting consequences from the point of view of the operational calculus. 

Let K(x) be a complex valued, Borel measurable function of real x. Define 
a corresponding operator K(H) on Fourier transforms 


f() = Jetdu(x) 
of Borel measures of finite total variation by 
K(H) -f(t) = fe**K (x)du(x) 
whenever f\K (x)du(x)| < @, 


Received July 11, 1958. This work was started at Cornell University under the supervision 
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For example, if K(H) = H’, 


H?-f(t) = fet*x? du(x) 
= Jet*x(x du(x)) 
= H-Hf(t) 


and H? is the double iteration of H. Similarly, if K(H) is any polynomial 
in H, the operator K(H) may be thought of as a finite linear combination 
of iterations of H. The functional notation K(H) which is so appropriate 
in these simple cases is also to be used when K(x) is only restricted to be 
Borel measurable. 

Let a > 0. The operator K(H) is said to be a-local if whenever f(t) in the 
domain of K(H) vanishes in [— a, a], 


K(H)-f(0) = 0. 


The problem is to find conditions on the defining function K(x) that the 
associated operator K(H) be a-local. 

The problem here is related to the problem in (2) of finding local operators 
on Fourier transforms, and was proposed by Pollard at the same time as the 
other problem. We continue with the ideas and: notation of the previous 
paper, except that Fourier transforms of not absolutely continuous measures 
are admitted in the domain of the operator. The defining function K(x) is 
assumed to have a finite value for each real x, and there are no a.e. identifi- 
cations. 

Recall the lemma of (2) which is the basis of our approach. 


LemMaA 1. If K(z) is an entire function of exponential type such that 


(1) lim r~log|K (re) | < alsin 6}, 
and 
(2) e~*"|K (ty) (g = x + ty) 


ts bounded, and if 
f(t) = Se*du(x) 


is an absolutely convergent Fourier transform in the domain of K(H) which 
vanishes in |— a, a], then for all complex z 


fees = 338) ay) = 6. 
s—s% 
Theorems I and II are the appropriate restatements of Theorem I of (2). 


THEOREM I. If K(z) is an entire function of exponential type satisfying (1) 
and (2), then the operator K(H) is a-local. 


THEOREM II. Jf K(H) is an a-local operator on Fourier transforms and if 
K(H) has in its domain a function which vanishes in |— a, a] and does not 
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vanish identically, then K (x) is the restriction to the real axis of an entire function 
of exponential type satisfying (1) and 


(3) fee ay ee 


1+? 


The gap between Theorem I and Theorem II arising from (2) has caused 
difficulty. The hypothesis (2) cannot be entirely removed from Theorem I. 
In fact, the function K(z) = zsin az does not define an a-local operator on 
Fourier transforms. This follows from the following uniqueness theorem. 


THEOREM III. If K (A) is a-local and if 
+a 
D (1+n")"|K(nx/a)| < @, 


then K (x) is the restriction to the real axis of an entire function K (2) of exponential 
type satisfying (1) and 

(4) eI" K (ty) = o(y) 

as |y| > @. 

Although Theorem III shows that the hypothesis (2) cannot be entirely 
removed from Theorem I, we conjecture that it can be removed if some 
hypothesis is made which keeps the modulus of K(z) from being too bumpy. 
We have been able to obtain the conclusion of Lemma 1 from a hypothesis 
of this nature. 


Lemma 2. If K(z) is an entire function of exponential type satisfying (1) and 
(3) and 


log” |K (¢)| 
6) 22 fs +7 0S 


is bounded above in the upper half plane y > 0, and if 
SO) = Setdu(x) 


in the domain of K(H) vanishes in [— a, a], then for all complex z 
fren k = Xfi du(t) = 0 


We have not been able to show that under the hypotheses of Lemma 2, 
the operator K(H) is a-local. The trouble lies in the following property of 
a-local operators which we have been unable to relate to any explicit descrip- 
tion of K(z). 


THeorEM IV. Jf K(z) is an entire function of exponential type such that 





(6) lim r~log K (re”) = alsin 6|, 


and if the operator K(H) is a-local, and if u is a Borel measure on the real line, 
of finite total variation, such that 
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J\K@du(t)| < @ 
and for all complex z 


fre — K(z) du(t) = 0 


t—2z 
then 
fK@dy(t) = 0. 


It is interesting to compare the hypotheses of Lemma 2 with the following 
alternative hypotheses for Levinson’s theorem (Theorem II of (2)). 


THEOREM V. If K(x) > 1 is a continuous function of real x such that log K (x) 
is uniformly continuous and 


(7) log K (x) de oo 


1+x , 
then there is no non-zero Borel measure uw such that 
J\K(x)du(x)| < ©, and fet*dy(x) 
vanishes in an interval. 


Proof of Theorem I. Let f(t) = Se**dyu(x) be in the domain of K(H) and 
vanish in [— a, a]. By Lemma 1, 


froma = fFO=*9 ne 





_ (tK(t) — 2K(z) 

it J S<—s du(t) 

_ (tK(t) — ze*K(z)e"** 

a J t—z du(t). 


Since 


atten eg * 2. ag @* 
O= Je “*'du(t) = | ———*— an, 


2 


froma - fke — eK (s) 5-014, (4), 


t—z 





Let z = iy where y — + @. By the Lebesgue dominated convergence theorem, 


f(t — 2) eK (te ‘du (t) 30 
f(t — 2)- te-“‘du(t) +0 


and since by (2) e“*K(z) remains bounded in the limit, 
K(H)-f(0) = [K(du(t) = 0. q.e.d. 
Proof of Theorem II. By hypothesis there is a function 
Hilt) = Se*dus(x) 





ng 


nd 
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in the domain of K(H) which vanishes in [— a@,a] and does not vanish 
identically. Write 


filt) = fet™By(x)dyu(x) 


where u is a non-negative Borel measure and B,(x) is a Borel measurable 
function of absolute value 1. The hypothesis that f,(#) vanishes in [— a, a] 
implies that the closed span in L'(uz) of the functions e**, — a <t <a, is 
not all of L'(u). Since f,(¢) is in the domain of K(H), K(x) is in L'(z). The 
hypothesis that K(H) is a-local implies that for every Borel measurable 
function B(x) which is essentially bounded with respect to wu, such that 
fe*B(x)du(x) = 0 for —a <t<a, we have f K(x) B(x)du(x) = 0. Since 
every continuous linear functional on L'(u) is defined by a Borel measurable 
function which is essentially bounded with respect to u, it follows by the 
Hahn-Banach theorem that K(x) lies in the closed span in L'(yz) of the 
functions e**, —a < t <a. Let 


M(z) = sup |L(z)| 
where L(z) ranges in the finite linear combinations of the functions e‘“* where 
—a<t<a.Since L(z) = (fay) appears in the supremum, M(z) > (fdu)-. 


Since the closed span of the functions e**, — a < t < a, is not all of L'(y), 
it follows as in the proof of necessity of Theorem I of (2) that 


(8) JS moa<s 


and 
(9) log M(x + iy) <aly| +2 f REO, wt (y # 0) 


and that K(x) is equal a.e. with respect to u to the restriction to the real 
axis of a unique entire function F(z) such that for all complex z 


(10) F(2) < M(2)f[|K()|du(2). 

Let »’ be any non-negative Borel measure of finite total variation such 
that uw < wp’ and S\K@ldy’ () < o, and let M’(z) be the corresponding 
majorant for u’. Since for any Borel function L(t), 

S\LO|du® <S\LO\du', 
we have for all complex z, 
M'(z) < M(z). 


Therefore, the closed span in L'(u’) of the functions, e'* —a qt <a, is 
not all of L'(u’) and K(x) is equal a.e. with respect to y’ to the restriction 
to the real axis of the entire function F(z), and for all complex z 


F(z) < M'(2)f|K()\du' (0). 
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By the arbitrariness in the choice of uw’, K(x) = F(x) for all real x. (1) and 
(3) now follow from (8), (9), and (10), q.e.d. 


Proof of Theorem III. Let x; and x2 be any two distinct real numbers, 
neither of which is an integral multiple of r/a. Let u be the measure which 
has mass (sin ax2)~' at x2, mass — (sin a@x,)~' at x,, and for integral m has 
mass 

(—1)" (axe — ax;) 
(ax, — nw)(ax, — nt) 





at n/a. It is easy to verify that Sidu(x)! < o and fet*dy(x) = 0 for 
—a<t<a. (Use the Fourier series for e'* in —a <t <a: 
e** ~ X(— 1)"(ax — nw)" sin ax e”“*.) 
Let K(x) satisfy the hypotheses of the theorem. They imply that f\K(@®du() 
< o, Since the operator K(H) is a-local, JK (x)du(x) = 0. Equivalently, 
K(x2) — K(x:) _ (—1)" (ax; — axz) K(nx/a) 


sinadx, sin ax; (axe — nm) (ax, — nr) 








Define K(z) for complex z in the unique way such that the same formula 
holds with x; and x2 replaced by complex variables z; and ze. Obvious estimates 
from this formula show that K(z) is an entire function of exponential type 
satisfying (1) and (4). (A similar formula is discussed in (1, pp. 220-1.) 


Proof of Lemma 2. Let f(t) satisfy the hypotheses of the lemma and let 
K(t) — K( 
(11) L(z) = peeks du(t). 


As in the proof in (2) of Lemma 1, L(z) is an entire function of minimal 
exponential type. Define 


log S(x) = logt|K(x)| 





i ly| log S(t) 
log S(x + iy) = - G-sP +e" (y # 0) 


The convergence follows from (3) and by (5) 
(12) S(x + ty) < e!"'S(x). 
Let the real number ¢ be held fixed and consider 
K(t) — K(z) 
5—s 

as a function of complex z. Then 
K(t) — K(z) 

'-—? 
where by Boas (1, p. 93), 


< lyf (IK(@®| + |K(e)|) 








\K(z)| < S(z)e*!"!. 
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Since S(t), S(z) > 1, we have 
ko ~ K(s) 
t — 


Zz 


< 2\y|* S()S(z)e*™'. 





Since 


K(t) — K(z) 
t—z 


has the same growth properties as K (z), the same reference to Boas shows that 


K(t) — K(x) 
t—<x 








< 2\y|~? S(t) S(x + 2iy)e™*"™' 
< 2\y\~ eriete) lvl S(t)S(x) 
by (12). Choosing y = (2c + 2a), we have 





(13) x) — Kes) < 4e(c + a) SSR) 
and hence by (11) 
(14) |\L(x)| < 4e(e + a)S(x)||Sul| 


where 


\|Sul| = fS@|\du(| < @. 


We claim that for m = 1,2,3,... 


(15) \L(x)|" < [4e(e + a)||Syl\|)"S(x) 
and hence (by the Boas reference) 
(16) \L(z)|" < [4e(c + a)||Su||)"S(z). 


We prove the inequality (15) by induction on m, starting with (14) when 
n = 1. Suppose that for some m, (15) holds, and we will prove the corre- 
sponding inequality with m replaced by m + 1. Since L(z) has minimal ex- 
ponential type, so has L"(z), and by (15), f L"(t)du(t)| < ©. By Lemma 1, 


fEO=2@ aw =0 


for all complex z. Therefore, 








= fF ae = Ret (t) du(t). 


By (16), 
\L*+1(z)| < |yl—*[|Z"(2)|f|K (Odu()| + |K(2)|f|L"(@du(0)|] 
< 2\y|-"[4e(c + a)||Sul| J" S(z)e*!""||Syl|. 
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The conclusion 
L"*"(x) < [4e(c + a)||Sul|]"*'*S(x) 


now follows in the same way we established (13). This completes the inductive 
step. 
Since in (16), # is arbitrary, for all complex z, 


IL(z)| < 4e(c + a)||Syl|. 


By Liouville’s theorem, L(z) = L is a constant. 

Consider various cases depending on the number and location of the zeros 
of K(z). Discard the case that K(z) has only a finite number of zeros, for it 
follows by Lemma 1. In this case, K(z) = P(z)e”* for some polynomial 
P(z) and —~a <h <a. If |h| < a, Lemma 1 can be used directly. For the 
limiting cases, let h + a or — a and use dominated convergence. 

If K(z) admits an infinite sequence of zeros z, = x, + ty, such that 
\z,| — © and y, is bounded away from 0, then 


L= tim f £O du(t) = 0 





by dominated convergence. 
We are left with the case that K(z) admits an infinite set of zeros, but all 
so close to the real axis that the last argument does not apply. Let h > 0. 
By (12), K(t+ ih) is dominated on the real axis by S(t) and the above 
argument with K(z) replaced by K(z + ih) shows that 
K(t + th) — K(z + th) 
t—s 





du(t) 


is a constant, independent of z. Since K(z + ih) has an infinite set of zeros 
at a positive distance from the real axis, the constant is 0. On letting h ~ 0 
and using (12), we have by the Lebesgue dominated convergence theorem, 


K(t) — K(z) oa 
fees s =a du(t) = 0. 


Proof of Theorem IV. Let K and yu be as in the hypotheses of the theorem, 
and let 0 < hk < a be held fixed. As in the proof of Lemma 1, 


thz 


e™* — 
Li) = f == 
(z) pag Ht) 
is an entire function of exponential type. By the hypotheses on K(z), 


K(s)L(z) = fieoao ee du(t) 





t—s 
, tht thz 
- fee =! K(t) du(t) 


and hence 
|K(2)L(z)| < |» (\K(2)| + e™)||u 









ve 


ll 
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where |/z|| = S\du < «. The hypothesis (6) now implies that 
lim r~' log |K (re) L(re“)| < a|sin 6). 


By applying the theorem (1, p. 116) on the effective asymptotic behaviour 
of these functions and using (6), we see that L(z) has minimal exponential 
type. The theorem is applicable since L(z) is bounded on a line 


|L(z)| < 2\y|-"| ||| 
and K(z) is the ratio of two functions bounded and analytic in a half plane. 
By (1, p. 83), the entire function L(z), being of minimal exponential type 


and bounded on a line, is a constant. By dominated convergence 
e™* = e™ 

L(s) = L= lim | —— dul) = 0. 

vata hoes vy 


Again by dominated convergence, 


tht 
f eau = lim —iy | ——dp(?) 








Voto = ty 
” Sand 
= lim —i — dyl(t 
Vato 4 = vy ul 
= 0. 


A similar argument shows that 
f(h) = Je*'du(t) 


vanishes when — a < h < 0. By the continuity of the Fourier transforms, 
this function vanishes in — a < hk < a. Since the operator K(H) is a-local, 


[K(@du(t) = K(H)-f() = 0. 


Proof of Theorem V. Let a > 0 and let T(x) = 7,(x) = sup |L(x)| where 
L(z) ranges in the entire functions of exponential type, satisfying (1) and 
(2), and such that for all real x, |L(x)| < K(x). By the proof of Theorem II 
of (2), the conclusion of the theorem follows if we can show that 


log T(x) 
- I+ 
(for every a > 0). By the uniform continuity of log K(x), there is some 
¢ > 0 such that 


= @o 


llog K (x1) — log K(x2)| < « 
whenever |x. — x;| < 2/2a. Let the real number x» be held fixed. For every 
positive integer n, 

log K(x) > log K(xo) — me 


whenever |x — xo| < mx/2a, and we will use this estimate with so chosen 
that 
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n (log x/2 + €) < log K (xo) 


(if such an m exists). For then the function 


r ™m . a(x — =) 
L(x) = | Dace —— sin rs 


is an entire function of exponential type, satisfying (1) and (2), and when 
le — xo| < nx/2a, 





log |L(x)| < log x/2 < log K(x) 
and when |x — xo| > nx/2a, 
log |L(x)| < 0 < log K(x). 
So L(x) is one of the test functions in the definition of T(x). Since 
log L(xo) = n log x/2, 
we have 
log T (xo) > 2 log x/2 
whenever 
n( log r/2 + €) < log K(x). 
So, for all real x, 


log x/2 : 
log T(x) > Ee log K(x) — log «/2. 


The hypothesis (7) now implies (17). 
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ON A THEOREM OF HAYMAN CONCERNING 
QUASI-BOUNDED FUNCTIONS 


P. B. KENNEDY 
1. If f(z) is regular in |z| < 1, the expression 


m(r,f) = J tos" if(re™*)| do (0<r<1) 


is called the characteristic of f(z). This is the notation of Nevanlinna (4) for the 
special case of regular functions; in this note it will not be necessary to discuss 
meromorphic functions. If m(r, f) is bounded for 0 < r < 1, then f(z) is called 
quasi-bounded in |z| < 1. In particular, every bounded function is quasi- 
bounded. The class Q of quasi-bounded functions is important because, for 
instance, a ‘‘Fatou theorem” holds for such functions (4, p. 134). It is known 
that f € Qdoesnotimplyf’ € Qin general; the simplest counterexample seems 
to be that given in (1), §8. Hayman (1, Theorem IV) found extra hypotheses 
under which f € Q implies f’ € Q. The purpose of this note is to construct 
examples which shed some light on the ‘“‘goodness’’ of these hypotheses. The 
method is similar to one used in (2); it is essentially a combination of a con- 
struction due to Littlewood (3, §8.5) with a theorem of Specht (5) on con- 
formal mapping. 


2. Let D be a domain containing a sequence of open arcs 


(2.1) sme, 6,<0<6,+4, #22 1,2,3,.... 


Let d(@) denote the distance from e“ to the boundary of D, and suppose that 
there are positive numbers A and B, independent of @ and n, such that 


d(6) > A{(@ — 6,)(6, + 5, — 0)}7 (6, <0< 6, + 5,," = 1,2,3,...). 


Then, following Hayman (1), we say that D properly contains the sequence of 
arcs (2.1). Hayman has shown that, if D contains |\z| < 1 and properly contains 
a sequence of non-overlapping arcs (2.1) such that 


(2.2) S 4, = Se, 
(2.3) Y 4 log < ©, 
1 n 
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and if f(z) is regular and bounded in D, then f'(z) is quasi-bounded in |z| < 1. 
This is a very special case of Theorem IV of (1), but it is sufficient for our pur- 
pose in this note. 

We prove the following theorem, which sets a limit to the extent to which 
it may be possible to relax the condition (2.3). 


THEOREM. Let {5,} be a non-increasing sequence satisfying (2.2) and 


(2.4) lim sup (= i,) log = = @, 


and let {0,} be such that no two of the arcs (2.1) overlap. Then there exist a domain 
D containing |z| < 1 and properly containing the sequence of arcs (2.1), and a 
function f(z) regular and bounded in D such that m(r, f’) ~ © asr—1. 


(2.4) does not follow from 





(2.5) 7. 5, log = = ©; 
1 n 
for example, if 
- 1 
n(log n)* 


for all large enough n, then (2.4) is false although (2.5) is true. Thus there 
remains a gap between the positive information given by Hayman’s theorem 
and the negative information which comes from the result proved in this 
note. I have not found a method of closing this gap. However, our result 
shows that (2.3) cannot in general be replaced by 


@ 1 a 
Xu i(1og 1) < @ 


for any a < 1, or even by certain rather stronger conditions than this. 

In the theorem, the sequence of arcs (2.1) may be obtained by putting 
6, = 0, O41 = 8, + 5,, in which case (2.1) is a sequence of arcs whose end- 
points have no limit-point except z = 1, and whose complement is the se- 
quence {e*»}. But it should be noted that a more complicated situation may 
also occur, in which the arcs (2.1) form the complement of an uncountable 
set. 

Some remarks on the other hypotheses of Hayman’s theorem are made in 


§6. 


3. In this section and the next we assume only that (2.1) is a sequence of 
non-overlapping arcs; the other hypotheses of the theorem are not needed at 
this stage. However, we suppose, as we may without loss of generality, that 
6: < 0, < 0; + 2x for every n. Let c > 0 be aconstant, and form = 1, 2,3,... 
put 


v (6) = c(@ meg 6,)* (6, + 5, _ 6) (6, <60< 6,, + 5,). 
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Moreover put ¥(@) = 0 for all other values of @ in 6; < 6 < 6, + 2x, and de- 
fine ¥(@) outside this interval by requiring it to have period 2%. Then we have 


LemMaA 1. There is an absolute constant K with the following properties: 
(i) for all @ and n, 
¥(0) < Keo — 8,)*, 


(ii) ¥(@) is twice continuously differentiable, and for all 6, 
lv’ (0)| < Ke, iW" (0)| < Ke. 


A full proof of Lemma 1 involves some lengthy repetition, and so we leave 
some details to the reader. Let the subset S of (6, 6: + 22) be defined by 


S= U (6, 6, + 5, ), 
1 


and let JT denote the complement of S with respect to the closed interval 
(@;, 0: + 2x). We first prove that, for @ € T and all @, 


(3.1) ¥(0) < Kic/é — ¢)', 


where K; is an absolute constant. If 6 € S, then 06, < 6 < 6, + 4, for some j, 
and therefore, since @ € 7, 
\@ — o| > min (0 — 6,, 6, + 6, — 8). 
Since 
(3.2) 6—6,<2n, 06,+56, —@ < 2x, 
it follows that 


1 
|@ — ¢| > 5, (6 — 6s); + 8; — 8). 


From this and the definition of ¥(@) we get (3.1), with a suitable K,, for all 
6 € S; and (3.1) is obvious if @ € 7. Further, a simple argument from the 
periodicity of ¥(@) shows that (3.1) must then be true for all @. In particular, 
putting @ = 6,, we get (i) with K = K;,. 

From (3.1) it follows that ¥’(¢) = 0 for all @ € 7. For 0, < @ < 6, + 6, 
we have 


¥' (0) = 3c(0 — 0,)?(0, + 5, — 0)?(20, + 5, — 26). 


We can now carry out an argument similar to that used above, with y’(@) 
instead of ¥(@), to find that 


ly’ (0)| < Kecl@ — o|? (@ € T, all 8), 
where K; is an absolute constant. It follows easily that y’(@) is everywhere con- 
tinuous and satisfies |y’(@)| < 4%*K.oc. Thus the part of (ii) which refers to 
¥’ (8) is proved with K = 42°K». 
The required properties of ¥’’(@) are proved similarly. Lemma 1 then follows, 
for some sufficiently large absolute constant K. 








596 P. B. KENNEDY 


Let the domain D be the interior of the simple closed contour r = exp ¥(6) 
in the z = re**’-plane. Then we have the following lemma. 


LEMMA 2. D contains |\z| < 1 and properly contains the sequence of arcs 
(2.1). 


It is obvious that D contains |z| < 1. Let d (6, 2) denote the distance bet- 
ween e” and exp{y(@ + h) + i(@ + &)}. To complete the proof of Lemma 2 
it is plainly enough to show that there is a number A > 0, independent of 8, 
h, and n, such that 





for all @ and h satisfying 
(3.4) A, <0<0,+56,, 0<O+h < 6, + 6. 
By periodicity we may suppose 
(3.5) h| < x. 

When (3.4) is true, we have 

} 
{d(6,h)}* = {exp y (0 + hk) — 1}* + 4 sin*4h exp ¥(6 + h). 
Thus by (3.5), 
d(0,h) > 2\sin 4 h| > 2x-"|hl, 

and if 
(3.6) |h| > (4)—"(0 — 0,)(0, + 5, — 9), 


then (3.3) follows from this with a suitable absolute constant A, say A = Ao. 
On the other hand, if (3.6) is false, then by (3.2) 


|h| < (6 —6,), |h| < $(6, + 5, — 8). 
Therefore 
d(6,h) > expy(6@+ h) —1>y(6+ h) 
> c(@ _ 6, — \h )*(6, + bn -@=— \h|)* 
> 2-*c(@ = 6,)* (8, + 5, = 6)*, 





and this is (3.3) with A = 2-*c. Thus in any case (3.3) is true with A = min 
(Ao, 2-*c), and this proves Lemma 2. 


4. We continue in the notation of §3. We now suppose that the constant c, 
occurring in the definition of ¥(@), is chosen so small that 


81 
(4.1) 1 < exp ¥(@) < 80’ 


’ i 
(4.2) lv’(6)| < 30° 














9) 


"cS 
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pF v' (0) — W' (Go) | 
(43) 2s J_, | sin §@—6) |” < 80° 
(4.4) exp ¥(0) <2-—r for |@— 6,|;<l—r e534 § ees 


the integral in (4.3) being uniformly convergent with respect to 4) for 0 < @ 
< 2x. The possibility of such a choice of c is easily seen from Lemma 1. 

Let w = w(z) map D one-one and conformally onto |w| < 1, with w(0) = 0 
and w’(0) > 0. We then have the following lemma, due to Specht (5, Theorem 
III). 


LemMA 3. (4.1), (4.2), and (4.3) together imply that w'(z) exists* for all z 
in the closure of D and satisfies 
(4.5) lw’(z) — 1| < 4. 


Lemma 3 follows at once from Specht’s result if, in that result, we inter- 
change the z- and w-planes, write exp ¥(@) instead of p(@), and pute = 6 = 1/80. 
It should be noted that Specht actually gives a bound for |z’(w) — 1|, where 
z(w) is the inverse function of w(z); but from this one may readily derive 
(4.5), which is more suitable for our purpose. 

From Lemma 3 and (4.4) we now deduce 


Lemna 4. For |@ — 0,| < 1 — r we have 


4(1 —1r) <1 — |w(re™)| < 3(1 — 1). 


First let z(w) be the inverse function of w(z), as above. If re“ = 2(pe**), 
then 
1 —r < |2(e)| — |2(pe*)| < |z(e) — 2(pe)|. 
This is less than 2(1 — p) by the first mean-value theorem, since |z’(w)| < 2 
by (4.5). Thus for all 6, 


1 — |w(re*)| = 1 —p > $(1 — 7). 
On the other hand, we have 
1 — |w(re“)| = |wlexp{y¥(0) + 10}]| — w(re")| 
< |wlexp{y(6) + 10}] — w(re)| 


< 3 {exp ¥(0) — r}, 


by the first mean-value theorem, since |w’(z)| < 3/2 by (4.5). Hence by 
(4.4), for |@ — @,| <1 —r, 


*If z is on the boundary of D, then by w’(z) we mean lim {w(t) — w(z)}/(¢ — 2) ast 2 
with ¢ in the closure of D. 
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1 — |w(re)| < ; (2—r—r) =3(1—71). 
This proves Lemma 4. 


5. In this section we make use of all the hypotheses of the theorem. Let 
D be as in §3, so that, in view of Lemma 2, the proof of the theorem will be 
complete when we have shown that a function f(z) with the required proper- 
ties exists. 

Choose a sequence of integers {m,} such that, ask > @, 


(5.1) er 


’ 
5m, +1 


(5.2) (x s,) log ‘a ©, 
Mk mk 
Such a choice is possible by (2.4). Put 


(5.3) %, = Fa 


where the square brackets denote the integer part, and let 


(5.4) f(s) = . r{w(e)\”, 


where w(z) is as in §4. Since w(z) is regular in D and |w(z)| < 1 there, it follows 
that f(z) is regular and bounded in D. 


Let 
(5.5) mn=1- a 
and denote by E, the union of the set of arcs 


z=r,e", |0@ —6,| < dn, (s = 1,2,3,...). 
When z € E, we have, by Lemma 4 and (5.5), 

$6, <1 — |w(z)| < 35,,. 
Hence by (5.4), when z € E,, 


w(z)f’(z) 








= |S rate 


w’ (z) 
k-1 o 
> k*m|w(z)|"* — » j'n,\w(z)|"* — > J n,\w(z) |” 
k—1 xo 

> k*m(1 — 38m)" — Do sy ’ny— DO F'n — 43m," 
that is, 

w(z)f’ (z) a ~~. a 
(5.6) | w’ (z) > k n,(1 3 5m,) > >: ’ 

















et 
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By (5.3), 
(5.7) (1—38,)"—7¢° (ko). 
Moreover, by (5.1) and (5.3), 
(5.8) > = O(1)(k — 1)-*m_, = 0(k*m). 


Further, the simple inequality 
(1 — x)" < (nx)-?, 
which holds for 0 < x < 1 and m > 0, gives 
(1 — 48m,)"? <405°8n2 = O(nz*n?), 
by (5.3). Hence by (5.1) and (5.3), 
E = O(nh) Ye 503" 
= O(k *ninzss), 


and so 


(5.9) do: = o(k-*m). 
Combining (5.6), (5.7), (5.8), and (5.9), we get 
jeter 


w' (z) 





> 2nk~*m, 


for all large enough k, where 7 > 0 is a constant. But by (4.5) and the fact 
that |w(z)| < 1 we have |w’(z)/w(z)| > 4, and so 


(5.10) \f’(z)| > nk-*m, (z € E,) 


for all large enough k. 
By the monotonicity of {5,} and the definition of E,, the union of the 
sequence of non-overlapping arcs 


z= r,e",0,<0< 0, + 5,," = m,m + 1,m+ 2,..., 
is contained in E,. Hence the angular measure of £, is at least 
Li 4,. 
Mk 
But 
, 1 , 
m(r,, f’) > a f tog" . Lf’ (ree) | 8, 


the integral being taken over the set of @ in (—72, ) such that r,e € Ey. 
Hence by (5.10), 
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m(rz, f’) > 3. (= s,) log (nk™*n,), 


for all large enough k. It is easy to deduce from this, with (5.1), (5.2), and 
(5.3), that m(r,, f’) ~ © ask — o. Since m(r, f’) is an increasing function of 
r (4, p. 8), it follows that m(r, f’) + © asr—1. This proves the theorem. 


6. It is easy to see from Theorem III of (2) that the condition (2.2) is 
essential for the truth of Hayman’s theorem, and that some “‘order-of-contact 
condition,”’ of the kind occurring in the definition of “properly contains,” is 
also necessary. It is possible to construct examples giving more complete in- 
formation about these hypotheses, but this does not seem worthwhile. 


Professor Hayman has sent me the following remark. “As Professor A. A. 
Goldberg has kindly pointed out to me, Theorem III of (1) is in fact contained 
in a more general result of R. Nevanlinna (Acta Soc. Sci. Fennicae, 50, no. 12 
(1925)), and this paper is also in other ways related to (1).” 
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SOME PRINCIPLES UNDERLYING THE CONSTRUCTION 
OF MEASURES 


CHARLES A. HAYES, Jr. 


1. Introduction. Measures may be obtained from suitable non-negative 
valued functions in a number of ways. It is the purpose of this paper to 
present an abstract formulation of certain principles which may be used to 
construct measures, and to show that the various methods most frequently 
encountered in the literature are in fact all special applications of these 
principles. 

The basic requirements Al—A4 are set forth in § 2, and it is there shown 
that a measure can be defined when these are fulfilled.* These requirements 
are satisfied in every case known to the writer. Usually, however, conditions 
stronger than Al-—A4 hold, and it is these extra restrictions which yield 
information on the class of measurable sets and other matters. In §§ 3, 4, 
and 5 certain abstracts form of such restrictions are considered, and results 
are derived thereforom. The paper concludes with an analysis of how a 
number of measures occur as special cases of the theory. 

We begin by stating a number of definitions and conventions which will 
be used throughout the paper. 

If § is any family of subsets of a set S, we agree to let \/ § denote the 
union of %; that is, the set of all points which belong to at least one member 
of §. For a sequence of sets A we sometimes use the notation ,.,"A, to 
denote the union. If A and B are sets, then by A — B we shall mean the 
set of those points which are in A but not in B. We agree to denote by U(x) 
the set whose sole member is x. We further agree to let 0 denote the null 
set as well as zero. 

When discussing metric spaces, the terms sphere, closed, open, Borel, etc., 
will be understood to have their customary meanings with reference to the 
metric of the space. If A and B are subsets of a metric space, the diameter 
of A will be denoted by diam A, with the convention that diam 0 = 0, and 
the distance between A and B will be denoted by dist (A, B). We keep in 
mind that inf0 = © and empty sums are zero. 

We shall say that the function measures S if, and only if, 


0< 4(A)< d o(8) 
ped 


Received September 10, 1958. 

*These principles were formulated as an outgrowth of an observation by A. P. Morse, 
who noticed that a certain measure occurring in (2), discussed in 6.3 of the present paper, 
satisfies conditions which are special cases of the above. 
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whenever § is a finite or countably infinite family and A C U § CS. This 
definition is equivalent to the usual properties, namely ¢(0) = 0; ¢(A) < ¢(B) 
whenever A C B CS; and 


(US) < } 4(8) 


Bed 


whenever § is a finite or countably infinite family and U § C S. 
The set A C S is said to be ¢-measurable if, and only if, ¢ measures S and 


o(E) = (A ME) + o((S — A) NE) 


whenever E C S. 

A family of sets § is said to be a ring if, and only if, (EU F) € § and 
(E — F) € § whenever E € § and F€ §. 

A family of sets § is said to cover a set A if, and only if, A C U §. 

We say that F is a blanket if, and only if, F is such a function that for x in 
its domain: 

(i) S is a metric space; 

(ii) x € Sand F(x) is a non-vacuous family of subsets of S; 

(iii) diam B < © whenever 8 € F(x); 

(iv) infger (diam (8 U U(x)) = 0. 

If F is a blanket with domain A, then ,.4 F(x) is called the spread of F. 


2. The general theory. At this point we introduce certain basic sets and 
functions which we shall understand to be fixed throughout the paper, along 
with the properties Al—A4 given below. In § 6, we will give certain examples 
which may serve to furnish motivation for the work to follow. 

We let S and § denote, respectively, a non-empty point set and a non- 
empty class of subsets* of S. It will turn out that S is our measure space. We 
let T denote another non-empty set which may or may not bear any relation 
to S. 

We further fix a function M with the following properties: 

Al. For each set A C S and each point ¢ € T, M(A,?) is a collection 
(possibly empty) of finite or countably infinite subfamilies of ¥o. 

A2. 0 € M(O, #) for each ¢ € T. 

A3. M(B, t) C M(A, t) whenever A C BC Sandt € T. 

A4. If ¢ € T and § is any finite or countably infinite collection of subsets 
of S, then there exists a function H on § to T such that if Q is any function 
on § with Q(v) € M(y, H(y)) whenever y € §, then 


U Qyr) € M(UG, #). 
ve 
Finally, we fix a non-negative finite-valued function f whose domain is 


*One could take > as a class of subsets of a space other than S without affecting the 
validity of any results of § 2. 
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$o. With f we associate the function f whose domain is the class of all subsets 
of S, so defined that whenever A C S, 


NA) ~ pte 


=. ’) % Jo ; 


We propose to investigate some of the properties of f. 
2.1. Lemma. If ACS, «>0, and t € T, then there exists a family ® € 
M(A, t) for which 
LX fla) < f(A) +« 


Proof. Evidently 
inf Do fla) <f(A); 
' aD 


Goma. ¢ 


hence there exists a family R € M(A, #) for which, as required, 
> fle) < ( inf > f(a)) +e<f(A) + 
ait _— 


2.2. THEorem. f measures S. 


Proof. From A2 above we clearly have 


O<ginf LD fe) < L fla) =0 
«M(0, 0) a) ae0 


for each ¢ € T; consequently f(0) = 0. 

If AC BCS, then from A3, we see that for each ¢ € T, 

O<ginf LY fe)<gini LD fla) <f(B), 
Gmu.o G Gm. G 

whence it follows that 0 < f(A) < f(B). 

Finally, we consider a finite or countably infinite family $ of subsets of 
S. Given « > 0, we select any convenient positive-valued function 9 with 
domain § for which 


(1) D 2) <«. 
vO 


If ¢ € T, we may use Aé4 to select a function H on § to T, enjoying the 
properties therein specified. Further, we may use Lemma 2.1 to find such 
a function Q that for each y € §, 


(2) Q(y) € M(y, H(y)), 2» f(a) < f(y) + ay). 


We let & = Veg (7). From (2) and the choice of H under A4 we infer that 
(3) R € M(U §, db). 
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Thus, using (1), (2), and (3), we see that 


waitly,. % M0) < E se < E (Zs) 
< d Uy) +27)) < E flv) +e 
1D ae 


From the arbitrary nature of both ¢ and e« in this last relation, we conclude 
that 


(US) < ¥ fo), 
1D 


thus completing the proof of the theorem. 


3. A specialization of the general theory. In order to obtain further 
specific information concerning f we shall observe the effect on f when tee 
functions M and f are subjected to certain restrictions in addition to those 
already imposed. 


For the remainder of this section we shall assume that T is directed by < 
and that the following holds: 


A3’ .M(B,t:) C M(A,t2) whenever AC BCS,4€ 7, € T 


and tle < hh. 
Clearly, A3’ is a stronger restriction than A3. In this case it is easy to see 


that for any set A C S, f(A) may be expressed as a Moore-Smith limit. For 
any t € T, we let 


f(A) =, inf DD fle); 
aD 


Gamu.o 
and then it follows that 
f(A) = sup f(A) = limf,(A). 
eT t< 


3.1. Derinition. If & is any family of sets and B is any set, then by © © B 
we shall mean that family of sets &’ for which y € @’ if, and only if, y = a f\ 8B, 
where a is some member of &. 


3.2. Derinition. If A CS, then we shall write A = S — A. 


We are now ready to formulate a condition on M which bears on the 
nature of the class of f-measurable sets. 

A5. If BE %, ACS, and t€T, then GOBE M(ANB,?2) and 
@ © B € M(A C/G, 12) whenever G € M(A, 2). 

This involves possible restrictions on the class o. 


3.3. DEeFIniITION. We shall say that f is weakly quasi-additive on its domain 
to tf, and only 7f, 








— ry 


nm 
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(i) a\B € Fo and a(\B € Fo whenever a € Fo and BE Fo; 
(ii) f(a) > f(a \ B) + f(a \ B) whenever a € Fo and BE Fo. 


3.4. THEOREM. Jf M satisfies A5 and f is weakly quasi-additive on Fo, then 
each member of %o is f-measurable. 


Proof. We consider any set A C S, arbitrary members 8 and ¢ of %> and T 
respectively, and an arbitrary member § of M(A,?#). Then, using A5 and 
Definition 2.3 we clearly have 


(1) > fle) > ¥ fans) + ¥ fens 
| al et) 


> + f+ E¥ fy) 
ye) Op 7 DOB 


>, inf > flv) +e, inf > fly) 
G & G . Par) 


«M(AN§, 1) ad M(aN§, 1) 


= f(A 8) +fdA 18). 
From the arbitrary nature of § in (1) we obtain at once 


(2) F(A) >f. (ANB) +f. (A O8). 


Since ¢ is arbitrary in (2), we obtain at once from the theory of Moore-Smith 
limits that 


f(A) = lim f(A) > lim f(A A 8) + lim f(A 2 8) 
> .> > 
= f(A 8) + f(A N 8B), 
so that @ is f/-measurable and the proof is complete. 
4. Another form of specialization. We consider now another speciali- 
zation of M and §> which is encountered in some applications. In such cases, 


f is not subjected to any requirements over and above those of § 2, except 
as the specialization just mentioned affects its domain §o. 


4.1. Derrnition. The element ¢ € T is said to be an ¢-approximator for 
(A) if, and only if, A C S and 


f(A) < X fla) +« 


whenever G € M(A, 2). 


4.2. Lemma. If « > 0, A CS, and f(A) < @ then there exists some t € T 
which is an ¢-approximator for f(A). 


Proof. It follows from the definition of f in § 2 that there exists some ¢ € T 
for which 
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f(A) <ginf YD fle) +e 
M(A, 0 a) 


from which it is clear that ¢ is the desired element. 

A6. There exists a relation R whose members are certain ordered pairs of 
subsets of S, with the following properties: 

(i) whenever A’ C A CS, B’ C BCS, and (A,B) € R, then (A’, B’) 
€ R; 

(iu) if ACS, BCS, CCS, (A,C) € R, and (B, C) € R, then (A UB, 
C) € R; 

(iii) whenever f(A) < ~, f(B) < ~, (A,B) € R, and e > 0, then there 
exist elements ¢, ?’, t’’ of T such that ¢#’ and ?¢” are, respectively, e-approxi- 
mators for f(A) and f(B); in addition, whenever @ € M(A U B, #) then there 
exist two disjointed families G’ and G@” with G’ € M(A,?’), G@” € M(B, t”), 
such that @’ U @” C G. 


4.3. THEOREM. If M and § satisfy A6 and (A, B) € R then f(A UB) = 
F(A) +F(B). 

Proof. Since f measures S by Theorem 2.2, we need prove only that 
F(A UB) >f(A) +f(B). Since this inequality clearly holds if either f(A) 
or {(B) is infinite, we may restrict ourselves to the case where F(A) and f(B) 
are both finite. We take « > 0, determine elements #, #’, t’’ of T in accordance 
with A6 (iii), and note that 


(1) Saint Dd fla) < f(A UB). 


«M(AUB, 1) a 
We select G € M(A U B, 2) so that 
(2) Saint Dd fle) > d fla) —«. 


«M(AUB, 1) aD a 
Applying A6 (iii) we determine two disjointed families @’ and @” with 
@’ € M(4,?’), G@” € M(B, t”), @’ U G” CG, put (1) and (2) together and 
use the nature of the choices of ¢’ and ¢”’ to obtain 
(3) f(AUB)> & fla) -« 
aD 
> d fle) + L fla) -« 
ac’ ac’ 

> (f(A) — ©) + YB) — ©) —€ = f(A) +f(B) — 3c. 

The arbitrary nature of ¢ permits the desired conclusion. 


A useful result which may be proved by simple induction based on A6 (ii) 
is stated in the following lemma. 


4.4, Lemma. If B C S, A; CS, and (A,, B) € R fori = 1,2,...,m, then 


i=l 
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We now state another useful result which follows by simple induction on 
Theorem 4.3 and Lemma 4.4. 


4.5. THEOREM. Jf M and §> satisfy A6, A, CS fori = 1,2,...,m, and 
(Ay, A,) € R for each pair of positive integers i and j such that! Ci <j <n, 


then 
Ava) = $ ja 


4.6. Lemma. If M and §o satisfy A6, ECS, BCS, and there exists a 
positive integer p and a sequence of sets Q such that 


(a) B= Ux 

(b) (Q:, 8) € R for each positive integer i; 

(c) (Qs, Qirp,) € R for each positive integer i and j, 
then 


f(8 E) = im j(0 (2. A B)). 
Proof. For each positive integer n, we let 
A, = U (Qs NE) CBN E, 


from which it follows that 


(1) lim f(A,) <f(8 A E). 


Now for each positive integer 1, 0 < i < » — 1, we consider the sets of 
the form Qi4,),7 = 1,2,.... From (c) of the hypotheses, we see that (Qi.5;, 
Qiury) € R whenever 1 < j < j’, so that Theorem 4.5 yields 


2) 2D fuss B) = Ao (Q9s z)) < f(A 4m) <f(8 1 EB). 
In case 
> f(Quses (1) E) = @ 


for some i,0 < i C p — 1, then we see from (2) that lim, f(A s49.) = ©, hence 
lim,f(A,) = f(8 (\ E) = @, whence the desired result holds. If, however, 


: (Ques NE) < @ 


for each i, 0 < i < p — 1, then for anygiven « > 0 there exists such a positive 
integer N that for each 1,0 <i < p — 1, 


> > f(Qusss (\ E) < €/p, 
j=N 
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whence it follows that 
(3) Yank) <« 
Since f measures S and 
BNE = Am iU U (ON 2), 
we have, using (1) and (3), 
HBO E) < Flaws) + © HAuNE) 
< f(Anp-1) te < lim f(Ax) +e<f(8K E) + 
From this last relation and the arbitrary nature of ¢, the desired result follows. 


4.7. THEOREM. Under the hypotheses of Lemma 4.6, B is f-measurable. 


Proof. Let E denote an arbitrary subset of S. Using again the notation of 
Lemma 4.6, we showed there that 


(1) lim f(An) = f(8 1) E), An C BQ) Eforn = 1,2,.... 


From (b) of the hypotheses of Lemma 4.6, A6 (i), the definition of A,, 
and Lemma 4.4 we have 


(2) (A,B E)ER, on 
Using (1), (2), and Theorem 4.3, we see that 
F(E) =fFBNE)U BOB) >fAsU @NEZ)) = f(As) + FE OE) 
for each positive integer m; applying (2) we have finally 
FE) >FBOAE +FEND), 
which establishes the f-measurability of 9. 


= ee 


ll 


5. Further considerations. Unrelated to questions of measurability but 
of interest in itself is the relationship between f(8) and f(8) for 8 € Fo. A 
relatively simple restriction on M, valid in many applications and yielding 
information on this point is now given. 


A7. U(8) € M(8,t) whenever 6 € %o and ¢ € T. 
5.1. THEorem. If M satisfies A7 then f(8) < f(8) whenever B E Fo. 


Proof. For each t € T we have 


inf >> f(a) < f(8); 
a 


Gow. t) 


thus f(8) < f(8). 














of 





f may not immediately seem to be in conformity with the definition in § 2 








THE CONSTRUCTION OF MEASURES 609 


Although M satisfies Al-A4 as well as A7 in the theorem just proved, 
only Al and A7 and the definition of f enter the proof. 

To reverse the inequality just proved requires relatively specific properties 
of both M and f, and there appears to be no clear-cut restriction valid in 
most applications which yields the desired result. We do obtain the following 
result, however. 


5.2. THEoreM. If M satisfies A7 then a necessary and sufficient condition 
that f(8) = f(8) for each B € §o is that for each « > O, there exists some to © T 
for which 

f(B) < > fla) +¢ 
whenever G& € M (8, to). 


Proof. If the stated condition holds, we clearly have 


(1) f(8) < inf Do fla) +e < f(s) +; 
Om, to) “Go 
since ¢ is arbitrary then f(8) < f(8). The reverse inequality follows from 
Theorem 5.1. 
If f(@) = f(8) and « > 0, then by the definition of /(@), there exists to € T 
for which, since f is finite-valued, 


f(8) =f(8)< inf = fla) +6 


Gome. to) 1 
from which follows at once the stated condition. 


6. Applications of the theory. 


6.1. The most common application of the theory just set forth consists of 
the case where S, §o, and f satisfy the conditions of § 2, and for any set AC S, 


f(A) is the infimum of all numbers of the form = ,. f(a), where G denotes a 


ac® 

finite or countably infinite subfamily of §%» which covers A. Thus defined, 
However, we may take for T any set consisting of a single element, for example, 
T = U(O0), and so define M that for each set A C S and each t € T, M(A, 2) 
= M(4A, 0) consists of all finite or countably infinite subfamilies of > which 
cover A. It is easily verified that Al-A4 are satisfied; in A4 one takes 
H(y) = 0 € T for each y € §. With this interpretation it is also easily seen 
that f as defined in § 2 agrees with the definition given more briefly above. It 
is clear that U(8) € M(Q, t) foreacht € T and each 8 € Fo, hence by Theorem 
5.1, (8) < f(8) whenever 8 € Fo. 

In case f is a measure* and §> is a ring of sets, then A3’ and A5 hold and 


*Here we use the term measure as defined in (3, p. 30) 
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Theorem 3.4 assures us that > is contained in the class of f-measurable 
sets. Furthermore, the criterion formulated in Theorem 5.2 is fulfilled, and 
consequently f agrees with f on §o. 


6.2. We now consider a second well-known method of defining a measure, 
where S is a metric space, > is an arbitrary non-empty family of subsets of 
S, and f satisfies the requirements of § 2. Here we take T to be the set of 
all positive real numbers. For each set A C S and each ¢ € T we define 
M (A, ?#) to be the collection (possibly empty) of all finite or countably 
infinite subfamilies of > which cover A, each of whose members is of diameter 
less than ¢. It is easily checked that Al—A4 are satisfied; in A4 one may 
take H(y) = t for each y € §. One may so define f, for a given ¢ € T that for 
each ACS 


Gomu, t) 


$(A)=,,inf Dd fle), 
a 


whence in accordance with the definition in § 2, 
f(A) = sup f(A). 


It is easily checked that for any given t € T, f, measures S; in fact, with ¢ 
fixed, f, is itself merely an example of the kind discussed in 6.1, with the 
added restriction on the covering families that their members be of diameter 
less than ¢. Also, in this case M satisfies condition A3’ of §3 with tf, < ¢, if, 
and only if, t2 > 4;, and accordingly for any set A C S we may write 


f(A) = sup f(A) = limf,(A), 
teT 150+ 


which is the usual form of representation for f. 
If we introduce the relation 


R = {(A, B):A CS,BCS, and dist (A, B) > 0}, 


then A6 is valid for M and >. For A6 (i) and (ii) clearly hold. If f(A) < =, 
f(B) < @, «> 0, and (A, B) € R, then one can evidently find t € T such 
that 

0 <t < dist (A, B), f(A) > f(A) — «, f.(B) > f(B) -«, 


from which ¢ is seen to be an e-approximator for both f(A) and f(B). If 
® € M (A U B, 2), we may define @’ as the subfamily of © whose members 
intersect A, @’”’ as the remainder of G, and then G@’ € M (A, 2#), G@” € M(B, 2), 
so that A6 (iii) holds. 

If 8 is any open set and 7 is any positive integer, we may define 
QO; = {x :1/(1+ 4) < dist (x, 8) < 1/i}, from which it follows easily that 
the hypotheses (a), (b), and (c) of Lemma 4.6 hold, with p = 2. Thus Theorem 
4.7 applies and all open sets are f-measurable, thus so are all Borel sets as 
well. 





Tn. ee =| — 4 
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As a special example of the above, we mention the case where > consists 
of all open sets in S, g is a fixed positive number, and 


f(8) = (diam 8)* 


whenever 8 € fo. The resulting f in this case is Hausdorff g-dimensional 
measure; all Borel subsets of S are f-measurable. Since A7 holds only for the 
null set and sets consisting of a single point, one cannot expect to make any 
general statement concerning the relative magnitudes of f(8) and f(8) when 
BE So. 

Lebesgue measure in n-dimensional Euclidean space may be regarded as a 
further example of the above. One may take >» as the class of all open intervals 
in S, and f as that function on > whose values are the hypervolumes of 
members of §o. As above, one may first define f, for each ¢ > 0 and then take 

f = lim f,. 
10+ 

Due to the nature of Euclidean space, f, does not depend upon the value of 
t; thus f = f... However f.. is clearly a measure-function of the type discussed 
in 6.1, with f..(8) < f(8) for each 8 € ¥o. Furthermore f.. is in conformity 
with the usual definition of Lebesgue outer measure. Thus f is ordinary 
Lebesgue outer measure and f(8) < f(8) for each B € Fo. Since f satisfies the 
criterion of Theorem 5.2, we have in fact /(8) = f(8) for each 8 € %o. From 
above we see that all members of > are f/-measurable, and consequently so 
are all Borel subsets of S. 


6.3. A somewhat more complicated situation occurs in the generation of 
Borel measures. Here* we have a locally compact Hausdorff space S, the 
class € of all compact subsets of S, and the class U of all open sets belonging 
to ©. The function \ is a content defined on G, and from } is derived the 
function A+ on Ul, so defined that for G € U, 


A,(G) = SUP A(C). 


It turns out that A* is subadditive on Ul From \* is derived the outer measure 
u*, where for A C S, 


CA oe 2 
re ae 


To show that u* may be interpreted as an application of our general theory, 
we take > = U, JT = U(0); for each set ACS and t=0€ T we take 
M (A, ?#) as the class of all finite or countably infinite subfamilies @ of U 
which cover A. As in 6.1 it follows that Al—A4 are satisfied. From the sub- 
additivity of A+ in Ul and the fact that U(G) € M (A,0) for each GE U 
such that A C G, it follows readily that 


*The notation and terminology of (3, pp. 231-6) are used in this section. 
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u*(A) = inf,, d,(G) inf > A,(a) 


acedl _ G.mu.o 1 


=e g int, , 22 dela), 
“2 2 


so that u* is an outer measure by virtue of Theorem 2.2. Since A5 and the 
hypotheses of Theorem 5.2 are valid we have yu*(G) = A*(G) whenever 
G € U. In this example, the measurability of the Borel sets and the fact 
that u* induces a regular measure yu follow from the particular properties of 
\ and are not taken up here. 


6.4. Other applications arise in the theory of differentiation of set functions. 
In these situations, §o is usually the spread of a blanket F, and f satisfies 
conditions considerably stronger than those of § 2. We let @ denote a function 
which measures S. For each A C S one may define f(A) as the infimum of 
numbers of the form >> gt), where & denotes a finite or countably infinite 


ae 
subset of 0 for which ¢(A — U G) = 0 (that is, G is a 0-covering of A). 
By making a rather obvious modification in the definition of M as given 
in 6.1, it is easy to check that the conditions Al—A4 are satisfied and f measures 
S. Another function f arises in some situations. Here for each fixed ¢ > 0 and 


each set A C S we define f,(A) as the infimum of numbers of the form 
; 2 gt (@), where @ is a finite or countably infinite subset of §> for which 

ae 
(A — UG) <¢ (that is, G is a tcovering of A). Then one defines 

f(A) = lim f,(A). 
150+ 

If one takes for T the set of all positive real numbers, and so defines M 
that for each set A C S and each ¢t € JT, M (A, 12) is the collection of all 
t-coverings of A, then it is not difficult to show that M satisfies Al—A4. In 
verifying A4, when ¢ € T and § are given, one chooses a positive-valued 
function H such that > gly) <t. If y € S we choose any family 

yer 


Q(y) € M (7, H(y)) and let O = Ure O(y), then o(7 — UQ(y)) < H(y) 
whenever y € 9 and > 


oUS-UX)< (VU (y¥-U Qa))) < XL oy — Or) < DAW) <t; 
veD vD 7D 
thus 2 € M (U §,?#) and A4 holds. 
In this example, condition A3’ of §3 is again valid, which makes possible 
the equation 


f(A) = lim f(A) = sup), inf > fla)l. 
150+ 


eT )Semca 1) & 
ae 


In the above, applications to differentiation theory A7 holds, so that by 
Theorem 5.1, F(8) < f(8) whenever 8 € §o. 
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As was mentioned earlier, f and F are usually subject to various technical 
requirements which make it possible to use the more specialized parts of the 
above theory to obtain information concerning the class of f-measurable sets 
and other matters. These are too lengthy for consideration here. 


REFERENCES 
1. A. P. Morse, A theory of covering and differentiation, Trans. Amer. Math. Soc., 55 (1944), 
205-35. 
2. C. A. Hayes, Differentiation with respect to o-pseudo-strong blankets and related problems, 
Proc. Amer. Math. Soc., 3, (1952), 283-96. 
3. P. R. Halmos, Measure Theory (New York, 1950). 


University of California, 
Davis, California 











PENCILS OF NULL POLARITIES 
SEYMOUR SCHUSTER 


1. Introduction. A theorem due to von Staudt states that a null polarity 
in complex projective space of three dimensions is determined by a self- 
polar skew pentagon. By allowing an element of the self-polar pentagon to 
vary in a suitable manner we can arrive at a family of ' null polarities, 
which we term a pencil of null polarities. Each polarity of the pencil dis- 
tinguishes a linear complex as the class of self-polar lines. Thus, associated 
with the pencil is a family of @' linear complexes, which we term a pencil 
of linear complexes. 

It is the purpose of this paper to continue an earlier investigation of pencils 
of polarities (2), by applying analogous techniques to the study of pencils 
of null polarities and pencils of linear complexes. 

Since it develops that the lines common to all linear complexes of a pencil 
are the lines of a linear congruence, the central question has been: How many 
of the different types of linear congruences can be achieved in this manner? 
Happily, it can be reported that the classification of pencils of null polarities 
yields all of the three types of linear congruences (4, pp. 140-141). 

We conclude with some remarks concerning such pencils in real projective 
space. 


2. Basic notions and constructions. Our basic configuration (see Figure 1) 
shall be the skew pentagon PQRST, where we designate certain planes by 
Greek letters as follows: 

TPQ = rz, PQR = n, QRS = p, RST = o, STP = +. 
The pentagon will be called complete if no four vertices are coplanar, and 
self-polar with respect to a null polarity II if I makes the following corres- 
pondence: 
P—1r,Q—1,R-p,S—-o¢, Tr. 

The correlation I which distinguishes a single line / in such a manner that 
I maps each point X (not on /) into plane X/, and each plane x (not through 
1) into point x-/, is clearly singular. We call T a special null polarity with 
directrix 1. (In this connotation some authors might object to use of the word 
“polarity”’ since [ is not 1 — 1.) The points of / and the planes through / 
do not belong to the domain of definition of I. 


Von Staupt’s THEOREM. If PQRST is a complete pentagon, the correlation 
P—1,Q—17,R-p,S—o,T—r 
Received January 6, 1958. 
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Figure 1 


is a null polarity; that is, a complete self-polar pentagon determines a null 
polarity. 


Since much of our work shall deal with the above correlation when the 
pentagon is not complete, we state the following: Jf four vertices, say P, Q, R, 
and T are coplanar (with no three collinear), then x = n, and the correspondence 


P—1r,Q—-1,9—-R,S—-¢,71->T 


is a special null polarity with directrix RT. The verification of this fact is quite 
simple. 


THEOREM 1. Let X be a point in general position. The polar of X in the null 
polarity defined by the self-polar pentagon PQRST is 


(1) x = X(XQT-n-7)(XRT-p-r). 


Proof. (See Figure 1.) The pole of every plane through QT is on 9-7. There- 
fore, the pole of XQT is the point X, = XQ7T-n-r. Similarly, the pole of XRT 
is the point X, = XRT-p-r. Thus, the three points X, X; and X; lie on x, 
and determine it according to (1). 

In the event that the null polarity is special, (1) reduces to the simple 
expression of x as the join of X with the directrix. 

The dual procedure yields the construction for the pole of an arbitrary 
plane. 
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3. Pencils of null polarities. 


Definition. A pencil of null polarities is the set of null polarities defined by 
the self-polar pentagon PQRST, where P, Q, R, and S are all fixed, while T 
varies on a line ¢ (not through any other vertices) in its fixed polar plane r. 


It follows from the definition that plane x and/or ¢ also vary in axal pencils 
about PQ and RS, respectively. 


THEOREM 2. The polar planes x ;f any fixed point X, with respect to a pencil 
of null polarities, form a pencil of planes. 


Proof. Referring to Figure 1, we call X,; = XQT-n-r and X_; = XRT-p-r. 
Then, by Theorem 1, x = X X,X>. Let T vary on #, and consider the pro- 
jectivity which maps the points of 9-7 onto the points of p-r as follows: 

P S 
X R- . 
1 x Q n°T = X» 
This projectivity possesses an invariant point, which arises when TJ is on 
plane XQR. Thus, the projectivity is a perspectivity. If we call the centre of 
perspectivity 0, then x is always on the line XO. 


We classify null polarities into three types, according to whether the line ¢ 
meets none, one or two of the fixed edges of the pentagon. Precisely, 

(i) P, Q, R, and S non-coplanar, and ¢ in general position (not meeting any 
of the fixed edges of the defining pentagon). 

(ii) P, Q, R, and S non-coplanar, but ¢ meets exactly one fixed edge of the 
pentagon. It follows from the definition of a pencil (7 being fixed) that the 
single edge which ¢ meets must be QR. 

(iii) The line ¢ meets two of the fixed edges of the defining pentagon. 

It is easy to see that any other degeneracy of the pentagon, or any other 
position of ¢ yields a pencil equivalent to one of the above types. We shall refer 
to these as the general, parabolic, and degenerate systems, respectively. (The 
justification for these names will be seen in §4.) 


The general system. T varies on the line ¢ not through any of the fixed lines 
of the pentagon. Thus, there are two distinct positions of 7, in the plane 7 
and in the plane p, which yield special null polarities. Although it may appear 
that further special null polarities arise when 7 is coplanar with P, R, and S, 
or similarly when T is coplanar with P, Q, and S, this is not the case. For, in 
these two instances it is possible to choose an alternate position anywhere on 
mx-o for the fifth point T of the self-polar pentagon; thus, showing that the 
polarity is, in fact, not special. Therefore, there are exactly two special null 
polarities in a gereral system. 


THEOREM 3. If two null polarities belong to the same general system, their 
product is a general axal homography (1, 385); conversely, every general axal 
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homography can be expressed as the product to two null polarities belonging to 
the same general system. 


Proof. The product of two such polarities leaves fixed the following elements: 
P, Q, R, S, 1, p, r, and t. Hence, r-QR is also fixed giving three invariant 
points on the line QR. It follows that QR is pointwise invariant, providing a 
point-axis for the homography. The tangential-axis is ¢, with the fixed points 
t-y and t-p. Thus, the conditions for the general axal homography are estab- 
lished. 

For the converse let the homography have QR as its point-axis and MN 
as its tangential-axis, with the collineation on the tangential axis defined by 
the projectivity MNT, x MNTy;. Then the given homography is the product 
of the two null polarities PORST, and PQRST:, where P and Q are arbitrary 
points on QRM and QRN, respectively. 


The parabolic system. This pencil is defined by having the line ¢, the locus 
of 7, meet the opposite edge QR. The position of T = ¢-QR yields a special 
null polarity, while all other positions of 7 yield non-special null polarities. 
Thus, a parabolic system possesses exactly one special null polarity. 


THEOREM 4. If two null polarities belong to the same parabolic system, their 
product is a biaxal homography (1, pp. 385-386). Conversely, every biaxal homo- 
graphy can be expressed as the product of two null polarities belonging to the 
same parabolic system. 


Proof. As in the proof of Theorem 3, the line QR is pointwise invariant 
under the product, since Q, R, and ¢-QR are three invariant points on the line. 
Further, the line PS is also pointwise invariant under the product, since P, S, 
and ¢- PS are three invariant points on that line. A biaxal homography is the 
projective transformation characterized by two such lines. 

For a proof of the converse, let the homography be determined by the 
axes PS and QR, and a pair of corresponding points A and B (see Figure 2). 
Let 7, be a point in general position, and call 


+r = PST, and t =7-QRT; 


The null polarity 1, = PQRST, belongs to a parabolic system with 7; 
varying on ¢t. We shall determine another polarity of this system to satisfy 
the requirements of the theorem. We call 


A, = AQT\:n°:r, Ao = ART;:p-r and a=AAj\A>. 


Then II, maps A — a. Since r is fixed for all members of the parabolic system, 
we may consider BA,A_ as the polar plane of B under one of the polarities in 
the system. To determine this polarity explicitly, we follow the converse 
construction calling T, = BQA,-t(= BRAz,-t). The desired polarity is then 
Il, = PQRST>. The product I,I, yields the original biaxal homography. 
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Figure 2 


The degenerate system. To fix ideas let t meet edges PQ and QR. Thus, the 
points P, Q, R and T are always coplanar forcing every null polarity of the 
pencil to be special with directrix RT. The fixed plane r is not defined in the 
degenerate system, sparing us the need of having S coplanar with the other 
four points, the consequence of which would be complete degeneracy with no 
null polarities defined. We may therefore conclude that all the polarities of a 
degenerate system are special with their directrices forming a flat pencil of lines. 


4. Pencils of linear complexes. The celf-polar lines of each null polarity 
of a pencil form a linear complex. The ©’ linear complexes which arise from 
a pencil are referred to as a pencil of linear complexes (2, pp. 92-93; 5, pp. 
332-333). Theorem 2 implies that each point X may be associated with a 
line x, called the axis of X, which is self-polar for all polarities in the pencil. 
(The axis of X is precisely the line OX mentioned in the proof of Theorem 2.) 
Further, every point of x is associated with the same axis. Thus, there are ~? 
axes, and we may state 


THEOREM 5. The set of lines self-polar for all polarities of a pencil form a 
linear congruence. 
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There are exactly three types of linear congruences in complex projective 
space (3, pp. 140-141). Our main result is that all three can be achieved as the 
axes of the three types of pencils of null polarities. More precisely, we state 


THEOREM 6. (i) The axes of a general pencil form a general linear congruence. 
(ii) The axes of a parabolic pencil form a parabolic linear congruence. 
(iii) The axes of a degenerate pencil form a degenerate linear congruence. 


Proof. (i) Each linear complex of the pencil has among its self-polar lines a 
flat pencil in 7 with vertex Q and a flat pencil in p with vertex R. Thus, the 
lines R(t-9) and Q(t-p) are the directrices of the linear congruence of axes. 
Further, ¢-7 # t-p, from which it follows that the directrices are skew. Hence, 
the axes form a general linear congruence. 

It is interesting to note that the line ¢ is a member of the congruence, so 
that we could deduce—independently of the definition—that the plane r+ 
must be fixed in a general pencil. 

(ii) The two directrices of (i) become coincident (with QR), and all the 
axes meet QR. Since the polar plane of QR-t is ORT in every polarity of the 
pencil, we know that ¢ is also an axis. Hence, we have the parabolic linear 
congruence established by the four linearly independent axes PQ, QR, RS, 
and ¢. 

(iii) Let the degenerate system be defined by having ¢ meet both PQ and 
QR. In §2, we saw that all the linear complexes of this pencil are special with 
directrix RT. Hence, all the lines of 7 belong to the linear congruence of lines 
which are common to all the linear complexes of the pencil. 

Now consider a point X in general position. Its polar plane is X RT. There- 
fore, XR is self-polar in every polarity under consideration. The desired 
degenerate congruence is then established as consisting of the bundle of lines 
with vertex R plus the set of all lines in 7. 


Before concluding we turn attention to pencils of linear complexes in 
real projective space. In this case the general linear congruence is classified 
as hyperbolic or elliptic according as its directrices are proper (real) or im- 
proper (2, p. 93; 5, pp. 315-318). We therefore consider whether both the 
hyperbolic and the elliptic congruences can be attained as the axes of different 
types of general pencils. 

An examination of the proof of Theorem 6 (i) shows that nothing is altered 
by the condition that PQRST be a real pentagon. The congruence of axes 
has as its directrices the lines R(t-) and Q(t-p), both of which are real. 
Hence, a hyperbolic congruence results. However, the elliptic congruence is 
unattainable for the very reason that lines R(t-7) and Q(t-p) are real lines 
(and are always the directrices). A further argument which also easily estab- 
lishes the unattainability of the elliptic congruence (when P, Q, R, S, and T 
are real) is the following: An elliptic congruence is generated by four linearly 
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independent skew lines, such that no one of them meets the regulus con- 
taining the other three in a proper point (5, p. 315). Yet our general system 
—indeed, all of our cases—always include three non-skew linearly independent 
lines PQ, QR, and RS, as generators. Hence it is impossible that the congruence 
of axes be elliptic. 
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ON THE SYMMETRY OF CUBIC GRAPHS 
W. T. TUTTE 


1. Introduction. Let G be a connected finite graph in which each edge 
has two distinct ends and no two distinct edges have the same pair of ends. 
We suppose further that G is cubic, that is, each vertex is incident with just 
three edges. 

An s-path in G, where s is any positive integer, isa sequence S = (0, %1,..., 
v,) of s + 1 vertices of G, not necessarily all distinct, which satisfies the follow- 
ing two conditions: 

(i) Any three consecutive terms of S are distinct. 
(ii) Any two consecutive terms of S are the two ends of some edge of G. 


If these conditions hold we call v9 the head and », the tail of S. 

An automorphism of G is a 1-1 mapping f of the set V(G) of verticew of G 
onto itself such that fv and fw are the two ends of an edge of G if and only if 
v and w are the two ends of an edge of G. The automorphisms of G constitute 
a group A(G). 

If S = (vo, v1,...,95) is any s-path of G we write fS for the s-path (fv, 
fo,,...,fv,), for each f € A(G). We say G is s-regular if for each ordered 
pair {S, 7} of s-paths of G, not necessarily distinct, there is a unique element 
f of A(G) such that T = fS. Our main object in this paper is the proof of the 
following 


THEOREM. Suppose all the oriented edges of G are equivalent under A(G). 
Then there exists a positive integer s such that G is s-regular. 


As is explained in (3) the s-regular cubic graphs can be divided into two 
classes according to the nature of the group A(G). The first of these classes 
is discussed in the main paper of (3) and the second in the Addendum. It is 
shown that s is at most 5 for the first class and at most 4 for the second. 
Examples of graphs, of the first class only, are given for the values 2, 3, 4, 
and 5 of s. Other examples of the first class are given in (1). In (2), Frucht 
describes a 1-regular graph, without determining to which of the two classes 
it belongs. In §3 of the present paper we show that all 1-regular cubic graphs 
belong to the second class. Accordingly Frucht’s graph is the first known 
member of this class. 


2. Proof of the theorem. Let S = (v%, 1, ...,,) be any s-path of G. The 
edges incident with v9 join it to just two vertices, w and w’ say, other than 2). 
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Similarly the edges incident with v, join it to just two vertices, x and x’ say, 
other than v,_;. We call the s-paths (w, vo, 71,...,9,-1) and (w’, v0, 0,... 
v,-1) the successors of S. Similarly we call (0, v2,...,9,,x) and (0, 2,... 
v,, x’) the predecessors of S. 

An s-path S’ of G is accessible from S if there is a finite sequence (S;, S2,... 
S,) of s-paths of G satisfying the following conditions. 


(i) S: = Sand S, = S’. 


(ii) For 1 < 4 < R, Si4; is either a predecessor or a successor of S, in G. 
(2.1) Any s-path of G is accessible from any other (with the same value of s). 


Proof. Let S be any s-path of G. Let W be the class of all s-paths of G 
accessible from S. Then S € W. 

Let V be the class of all vertices of G belonging to at least one member of 
W. If V(G) — V is not null then, by the connection of G we can find an edge 
E of G having one end p in V(G) — V and one end g in V. Now gq belongs to 
some U, € W. Starting with U, and taking predecessors as often as necessary 
we obtain U, € W having q as its head. But then is the head of a successor 
of U2, contrary to the definition of ». We deduce that V = V(G). 

Let S’ = (wo, wi,...,W,) be any s-path of G. By the result just proved 
there exists Z, € W such that w, is a vertex of Z;. From Z, by taking succes- 
sors we obtain Z, € W having w, as its tail. From Z, by taking predecessors 
we can obtain Z; € W with the following properties: The head of Z; is w, 
and the second term of Z; is not w,_,;. Accordingly S’ can be obtained from Z; 
by taking successors. Hence S’ € W. 


(2.2) Suppose there is a positive integer s such that all the s-paths of G are 
equivalent under A(G) but G is not s-regular. Then for each s-path S of G there 
exists f € A(G) such that fS = S and f interchanges the two successors of S. 


Proof. Let S be any s-path of G, with successors T and 7”. Since G is not 
s-regular there are s-paths Z,; and Z, and distinct elements x and y of A(G) 
such that Z; = xZ, and Z,; = yZ>z. There exists z € A(G) such that Z,; = 
2S. Write g = z-'xy~'z. Then gS = S but g is not the identical automorphism 
of G. 

We now show that there is an s-path S,; of G such that S,, but no successor 
of S;, is invariant under g. For suppose not. Then if Z is any s-path invariant 
under g one of the successors of Z is also invariant under g. Hence both succes- 
sors of Z are invariant under g. Let Z—' be the s-path obtained from Z by 
reversing the order of the vertices. Then Z~' and its successors are invariant 
under g. Hence the predecessors of Z are invariant under g, since they are 
obtained from the successors of Z—' by reversing the order of vertices. Hence 
each s-path of G is invariant under g, by (2.1). This is impossible since g is not 
the identical automorphism of G. 
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Let the successors of S; be 7; and 7;’. Let h be one of the elements of A(G) 
satisfying hS = S,. Then 4 maps the successors of S onto the successors of 5. 
We may adjust the notation so that AT = 7, and kT” = T,’. Since no successor 
of S; is invariant under g we have g7, = 7;' and g7,' = 7;. Write f = h-'gh. 
Then fS = S, fT = T’ and fT’ = T. This completes the proof of (2.2). 

We complete the proof of the main theorem as follows. We are given that 
all the oriented edges of G are equivalent under A(G). Hence there is a greatest 
positive integer s such that all the s-paths of G are equivalent under A(G). 
Assume G is not s-regular. 

Consider any (s + 1)-path X = (vo, 0,...,%,-1) of G. Let its successors 
be Y = (y,00,%,...,0,) and Y’ = (y’, v0, %,...,9,). Let & be one of the 
elements of A(G) satisfying k(01, v2,...,0,-1) = (vo, 01,...,0,). Then kv 
is either y or y’, and we may adjust the notation so that kv = y. This implies 
Y = kX. By (2.2) there exists f € A(G) such that f(v0, 0:,...,0,) = (0, 0%, 

., 05), fy = y’ and fy’ = y. This implies Y’ = fY = fkX. Hence X can be 
transformed into either of its successors by an automorphism of G. Similarly 
each predecessor of X can be transformed into X, and therefore X can be 
transformed into either of its predecessors. 

Combining these results with (2.1) we see that all the (s + 1)-paths of G 
are equivalent under A(G). But this contradicts the definition of s. The 
theorem follows. 

We should perhaps remark here that in an s-regular cubic graph G the ver- 
tices of any s-path are all distinct. For any circuit of G has at least 3 vertices 
by our definitions and at least 2s — 2 by (3, Theorem III). 


3. l-regular cubic graphs. Let G be any s-regular graph. Let S) be a 
fixed s-path of G. Any element x of A (G) can be associated with a permutation 
x’ of the set S of all s-paths of G defined as follows: x’(4S9) = hxSo for all 
h € A(G). If x,» € A(G) the permutations x’, y’ satisfy the law (xy)’ = y'x’. 
Moreover, the correspondence x ++ x’ is 1 — 1. For x’(S 9) = xSo for each 
x € A(G), and there is only one element of A(G) mapping S» onto xSp». It 
follows that the permutations x’ of S are the elements of a permutation group 
H of the same order as A(G). 

In (3) the symmetry of G is discussed in terms of H. There is a unique pair 
(r, 1) of elements of H mapping S» into its two successors. It follows that 
rS and lS are the two successors of S for each S € S. Another important 
element of H, denoted by &, reverses the order of the vertices in each s-path 
S. Thus & = 1. 

It is clear that &r&S and £/éS are the two predecessors r~'S and /[-' S of S, 
for each S € S. There are now two possibilities. In the first alternative 
trt = r-' and &lé = [-'; in the second éré = [-' and é/t = r~'. We say that G 
is of the first or second class according as the first or second alternative holds. 
These are the two classes mentioned in the Introduction. 


(3.1) Every 1-regular cubic graph is of the second class. 








624 W. T. TUTTE 


Proof. Let G be any 1-regular cubic graph. Let a be any vertex of G. Let its 
incident edges join it to the three vertices 5, c, and d. We may suppose r(a, db) 
= (c,a) and l(a, 6) = (d,a). Then [-'sl-'(d, a) = ['r(a, 6) = F'(c, a). But 
-(c, a) is distinct from r—'(c,a) = (a, 6), and is therefore (a, d) = £(d, a). 
Hence & = ['#l-'. Similarly r~Ur(c, a) = E(c, a) and therefore § = r—“/r—. 
Hence tré = r—'Ir—'rl-'rIl-' = I". Accordingly G is of the second class. 
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QUADRICS OVER GF(2) AND THEIR RELEVANCE 
FOR THE CUBIC SURFACE GROUP 


W. L. EDGE 


1. Introduction. About half the following pages are concerned with the 
“cubic surface group” G, of order 51840, and the geometry cognate to a 
certain representation thereof. The literature of this group, with its subgroup 
of order 25920, is already voluminous; the addition of these few pages to it 
will not, it is hoped, be regarded askance as a perverse and misdirected 
indulgence in archaism. On p. 104 of (2) one reads 


It is thus suggested that we should interpret the symbols of lines as being symbols of points 
in space of 5 dimensions, every such point being dependent upon 6 points; the points so arising 
from the lines of a tritangent plane of the cubic surface will then be upon a line. We are thus 
led to a configuration of 27 points, in this space, lying in threes upon 45 lines, there being 5 of 
these lines passing through each of these 27 points. 


Precisely such a figure is to hand; one has merely to base the geometry on 
the finite field F, of two marks 0, 1 with 1 + 1 = 0. There are, in a projective 
space [5] of 5 dimensions over F, two projectively distinct types of non-singular 
quadric; one type has planes on it or, as one may say, is ruled, and the other 
not. It is this latter quadric that underlies the figure so forcibly suggested to 
Baker, and the geometry associated with it is summarized in Table II below. 
This geometry cannot be appreciated properly unless the geometry in the 
subordinate spaces of lower dimension is known in some detail; the earlier 
pages aim therefore to assemble this essential information. 

To lay these foundations, and so adequately to describe the geometry that 
underlies this aspect of the cubic surface group, is not, however, the sole 
purpose of these pages. Once the geometry in [5] is known it may help to 
open the way towards the geometry, over F, in spaces of higher dimension, 
of quadrics and their groups of automorphisms. There is a quadric in [7] 
consisting of 135 points and having on it, in accordance with Study’s prin- 
ciple of triality (8, p. 477), two systems each of 135 solids; its group of auto- 
morphisms, of order 24°355?7, is isomorphic to a group already investigated 
in a different setting (7); in a [7], yes, but with the complex field, not F, as 
base field. Just as Table II below is intimately related to Table I in (7) so 
there is a table allied to Miss Hamill’s elaborate Table I]. The geometry 
of quadrics over F is the avenue towards its compilation and any light that 
it may shed on the structure of the larger group. The table in question was 
in fact compiled a year or two ago. 


Received October 27, 1958. 
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All the geometry in this paper is based on F so that, throughout, a line 
consists of 3 points, a plane of 7 points lying 3 by 3 on 7 lines, and so on. 


2. A symmetric senary quadratic and its automorphisms. A pro- 
jective space [”], of m dimensions, over F consists of 2**' — 1 points. Consider, 
among the 63 points of [5], those whose co-ordinates satisfy 


5 
2.1 > xa, = 0, 
i, j= 0 
the summation being over all pairs i,j with i <j, and so embracing 15 
products. Such points constitute a quadric 4% Each product x,x, is 0 or 1, 
and a point is on or off # according as an even or an odd number of the 15 
products is 1. Now if x, = 1 then x, = x, = 1; hence when a point has 


eee a = 

of its 6 co-ordinates non-zero the number of non-zero products is, accordingly, 
0, 1, 3, 6, 10, 15. 

Thus / consists of 


(;) +() +(5)-e+ 5 +0=m 


points. Denote any such point by m, and any of the 63 — 27 = 36 points 
that are off Z by p. This notation will be used throughout to distinguish 
between points on and points off a quadric. 

The partitioning, as 6 + 15 + 6, of the 27 m is adventitious in that it de- 
pends on the system of co-ordinates; but it throws one cardinal feature into 
strong relief. The first batch of 6 consists of the vertices X, of the simplex 
of reference 2»; the last batch of 6 consists of the vertices X ,’ of another simplex 
Zo’; Zo and Zo’ are in perspective from the unit point U (2, p. 105) and are 
in the Mdbius relation, each being both inscribed and circumscribed to the 
other. Furthermore, 2» and 2,’ are both circumscribed to as well as inscribed 
in Y& the tangent prime of # at any vertex of either being a bounding prime 
of the other. All this is manifest from the equations 


x, =0, Xe = Xo tei txt xs+x%1+ xs =e, 


say, of the bounding primes of 2» and Zo’, and from the co-ordinates of their 
vertices; all co-ordinates save one of X, are 0, all save one of X,’ are 1. That 
Z» and %»’ are related to # in precisely the same way follows because 


do @+x)(e+x) = ¥ xe; 
i<j i<j 


over F; the extra terms on the left amount to (15 + 5)e? and so, the coefficient 
being even, disappear. 
U is but one of the 36 p. Each is a centre of perspective for two simplexes, 
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inscribed and circumscribed to “and in the Mdbius relation. The corres- 
pondence with the 36 double-sixes on a cubic surface is unmistakeable. 


3. Subject, now, the variables x, to the linear substitution 
x = Mé 


where x and é are column vectors, each having six components and M = (a,,) 
is a 6 X 6 matrix in which each a,, is an element of the field F. The coefficient 
of &? on the left of 2.1 is Z,<,aua,. If the substitution leaves 7 invariant 
each square £,? must be absent; hence, foreach k, 2i<)@ua, = 0; in other 
words, each column of M is the co-ordinate vector of a point on 4% The 
column thus includes 1, 4, or 5 non-zero co-ordinates. But it is not enough, 
to ensure invariance, that each square £,’ be absent; every product ££, has 
to be present. The coefficient of, say, f:f2 is Dus 44 Gyo, a sum of 30 products, 
whose vanishing is the condition for the conjugacy of those 2 points of / whose 
co-ordinate vectors are the first 2 columns of M. Thus the columns of M 
have not only, each of them, to represent points m; it is also necessary that 
no two of these 6 m be conjugate, that is, that each of their 15 joins has to 
be a chord c, never a generator g, of Z% These conditions, that the columns 
of M are co-ordinate vectors of points on Y no two of which are conjugate, 
are, when supplemented by the proviso that the columns be not linearly 
dependent, sufficient as well as necessary for the quadratic form to be in- 
variant under the linear substitution. 

It might, in building a matrix M to satisfy these restrictions, be helpful 
to observe that they imply that each column is conjugate to the sum of any 
two of the others: the conditions 


> atm =1= » A410 jn 


ij ij 


imply 
Dd auilaim + On) =14+1=0. 
ij 


The equation of “ may take other forms than 2.1; see, for one example, 
§ 17 below. Each such form affords, by its group of automorphisms, a repre- 
sentation of G; the matrices that constitute this representation are subject 
to appropriate restrictions. Some of these forms may well lead to simpler 
calculations for the order of G, but the symmetric form 2.1 holds the advantage 
that the restrictions on M involve all its columns symmetrically. 


4. These discussions apply not merely to senary forms but also to forms 
in other numbers of variables; they afford an opportunity not only of exhibiting, 
in this context, certain known facts about the cubic surface group but also of 
investigating other groups. The indispensable prerequisite is a thorough 
knowledge of the underlying geometry, and this has to be grounded on the 
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geometry in spaces of lower dimensions. It is advisable to preface the account 
of this by a section which takes note of the idiosyncrasies due to the base 
field having characteristic 2, and such a section now follows. 


5. The null polarity. A quadratic form 
f(x) = f (xo, x1, . ~~, Xn) 


is singular when it is expressible as a quadratic form in fewer variables x,’, 
linear forms in the x; The quadric f = 0 is then a cone whose vertex V is the 
[” — r] over which all x,’ simultaneously vanish; r is the rank of the matrix 
of coefficients in the x,’, and every point of V satisfies f = 0. The polar form 
of any point of V vanishes identically; every point of V is conjugate to every 
point of [z]. The converse is also true except over fields of characteristic 2; over 
such fields a point can be conjugate to every point of [] without f being 
singular. If, for example, n = 2, 


f S XyX2 + X2xy + Xor 


is non-singular; yet the point x; = x2 = x; is conjugate to every point of the 
plane. This phenomenon can always occur for even n, as is now to be explained. 
Recall, first, the genesis of the polar form: it is the coefficient of Aw in 


fx + wy) = f(Axo + wyo, Axi + wy, ~~~, Kn + Un), 


and is symmetric in x and y. Since, over F, \u has zero coefficient in (Ax, + wy,)’, 
squares of the variables are ignorable when polarizing; all 2"*' forms 
2 2 2 
oro + ayX, +... + agX, + _ Dip Xy, 
i<j 

with different a, but fixed 5;,, have the same polar 2d,,(xyw, + x y;). When- 
ever y = x every term of this sum vanishes; the polar prime of P always 
passes through P whether P be on f = 0 or no; the polarity set up by a 
quadric is, over F, a null polarity. 


6. A point common to the polar primes of the + 1 vertices of the simplex 
of reference is common to those of all the points of [7], and conversely; such 
points constitute an [m — g] where g is the rank of 


— — 
° box boe eee Bon 
boi ° bie eee Din 
B= 
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there being no common point if B has its full rank m + 1. If m is odd the 
rank may be full; of this two instances, both convenient to have at hand, may 
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be noted. First: let b,, = 0 except that 6,44; = by41., = 1 for each even i. 
This matrix answers to the canonical form 


Xo. + Xot3 +... + Xy-1%, = 0 


of a ruled quadric (that is, of a quadric having on it linear spaces of the maxi- 
mum possible dimension $(m — 1)), and the non-vanishing of |B! is obvious. 
Second: let 6;; = 1 except for i = 7. Here too |B| = 1 for odd n, and B 
answers to the symmetric form 2 j<,;x ,;, with n(n + 1) terms in the sum. 


7. That |B| always vanishes when n is even follows because any non-zero 
term in the determinantal expansion is partnered with another; the expansion 
of, for example 

i ea 

Se FJ 

gf 
consists of two equal terms fgh. A set of » + 1 units in B whose product 
supplies a term of |B| cannot, m + 1 being odd, be placed symmetrically 
about the leading diagonal; thus any non-vanishing terms of |B) are paired, 
those of a pair being products of sets of m + 1 units with either set the image 
of the other in the diagonal. The two products, being equal, have, over F 
sum zero. Alternatively: that |B| = 0 for m even is a consequence of B being 
skew: over F a matrix that is symmetric is skew also when its diagonal 
consists of zeros. 

Although the rank g of B is, when is even, less than m + 1, it need not 
be less than n, for a principal minor can have rank n. Thus, for even n, there 
may be a single point conjugate to every point of [mz]. This point need not 
satisfy f = 0. For take the symmetric form 


f= 9 X X35 


i<j 


whose B, when m is even, has g = n. The point conjugate to every point of 
[x] is the unit point 


%o=@zxX=...=2, = i, 


at which f = $n(n + 1); this is zero or not according as $m is even or odd.* 
And if 3m is even we only (in order to obtain a non-singular quadric with 
a point conjugate to every point of [m] and yet not lying on the quadric) need 
to take for B the direct sum of [;!] and the matrix of the above symmetric 
form in m — 2 variables. If, for example, » = 4 this leads to the form 


y2 + ox + xy + UD; 


*When $n is even the symmetric form is singular, being the same as) ecics(Xe + Xi)(xo+x;) 
wherein xo? occurs an even number, $n(m — 1), of times and xox; an odd number,n — l, 
of times. 
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the single point conjugate to every point is x = y= z= 1, u = » = 0; it does 
not lie on the quadric. 


8. Whenever a point x on f = 0 is conjugate to every point of [mn] its join 
to any other point y on f = 0 lies entirely on the quadric because 


fQx + wy) = Nf(x) +0 4+ wf(y) =0+04+0 =0; 


this implies that x belongs to the vertex. But its join to any point z not on 
f = 0 does not meet the quadric save at x because 


f(dx + wz) = A*f(x) + 0 + wf(z) = wf (z). 


Note too that if P, P’ are any pair of conjugate points then the line PP’ 
has to meet f = 0, if at neither P nor P’ than at P + P’, because 


f(x + x’) = f(x) +0 + f(x’) = f(x) + f(x’). 


Hence if g < m — 1, so that there are at least 2 points conjugate to every 
point of [], f is necessarily singular. 


9. Suppose then, the base field being F, that m is even and that a non- 
singular quadric Q is given. There is a unique point k conjugate to every 
point of [”]; it does not lie on Q and may be called its kernel. Any point m 
on Q, being conjugate both to itself and to k, is conjugate also to the third 
point p on km; p is off Q because the join of two conjugate points on Q is 
entirely on Q, whereas & is not. Conversely: the third point of the join of k 
to any point p off Q is on Q because the join of two conjugate points has to 
meet Q. Thus, of the 2**' — 2 points of [m] other than k half are m, on Q, and 
half are p, off Q; Q consists of 2" — 1 points one on each line ¢ through &. 
The m and p on any ft) have the same polar prime—the tangent prime of Q 
at m. There are in [m] 2" primes other than these tangent primes, namely 
those primes not containing k; these, because of the degeneracy of the null 
polarity, are not polars of points. 


16. Quadrics in [2] and [3]. One now sets out to describe the quadric 
in the spaces of low dimension over F. The description must cover singular, 
as well as non-singular, quadrics because singular quadrics occur among the 
sections of non-singular ones in higher space. 

There are 7 binary quadratics over F; of these x’, x? + y?, y? are perfect 
squares, x? + xy, xy, xy + y® are reducible and x? + xy + y® is irreducible. 
The perfect squares are singular; equated to zero they give each a repeated 
point on the 3-point line L whereon x, y are homogeneous co-ordinates. The 
other forms are non-singular; the reducible ones give the pairs of points of 
L, but x* + xy + y? has no zero. These three types of binary quadratic corre- 
spond to the three ways in which a line that is not a generator can be related 
to a quadric Q; there are tangents ¢, with a single point on Q, chords c, with 
two points on Q, and lines s that are skew to Q. 
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A singular quadric in |[m] has a vertex [m — r] and projects therefrom a 
non-singular quadric in [r — 1]. Hence, taking m = 2, singular conics are of 
three types; repeated lines (r = 1) and either line-pairs or single points (r = 2). 
A non-singular conic consists of 3 points A, B, C, one on each line through its 
kernel k; of the 7 lines in its plane, 3, namely BC, CA, AB, are c, 3, namely 
kA, kB, kC, are t, while the remaining line s is skew to the conic. And a plane 
section of a quadric Q in higher space may be by a plane 


e, meeting Q in a single line, 

f, meeting Q in two lines, 

h, meeting Q in a single point, 

j, meeting Q in 3 non-collinear points. 


It may also happen, if m > 5, that a plane d lies wholly on Q. But no plane 
can ever be skew to Q. Since a singular quadric in [3] is the projection of a 
conic from a point, one can now tabulate the following sections of Q; a section 
may be by a solid 


7, meeting Q in a single plane, 

¢, meeting Q in two planes, 

xX, Meeting Q in a single line, 

¥, meeting Q in 3 concurrent non-coplanar lines. 


It may also happen, if » > 7, that a solid w lies wholly on Q. There are also 
solids x meeting Q in ruled quadrics, and solids A, meeting Q in non-ruled 
quadrics; these sections are not singular, and will be described in a moment. 
The letters here used to signify points, lines, planes, solids in their various 
relations to Q will be retained throughout. The nature of the subspaces in a 
given space is recognizable at a glance. In, say x, each of the 3 planes through 
the g is an e while the other 12 planes are all h; the 16 lines skew to g are 
all s while the 18 which, 6 at each of its 3 points, meet g are all ¢. The points 
in an e other than the 3 m on g are a quadrangle of p; these quadrangles, 
like a triad of desmic tetrahedra, are such that any two of them are in per- 
spective from every vertex of the third. Geometry in solids other than x is 
clear too; there will be occasion to speak of y in § 18 below. 


11. A non-singular ruled quadric, or hyperboloid, #in [3] has the canonical 
equation xy + 2t = 0; there are on AH two complementary reguli with 3 g 
in each. Thus # consists of 9 m; at each m the tangent plane f is spanned 
by lines one in each regulus; the third line ¢ in f through m does not meet 
H again and is, like each g, its own polar. The 6 points of [3] off #lie (4, 
p. 323) 3 on each of two lines s, s’; these are polar lines with respect to # 
They may be called the Dandelin lines of #% for Dandelin’s classical con- 
struction sets out from 3 lines in each of two complementary reguli and is 
viable over any field. When the field is F the construction utilizes every 
point of the space. 
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12. That the quadric p, whose equation is 
(12.1 xy + xz + xt + y2 + yt + 2t = 0, 


is non-singular was implied in § 6. The vertices of the tetrahedron of reference 
T all lie on p, and each face of JT meets p in the three vertices of 7 therein 
and nowhere else; should any co-ordinate be zero no point other than the 
vertices of 7 satisfies 12.1. But there is one point, namely the unit point, 
none of whose co-ordinates is zero, and this does satisfy 12.1. p is, indeed, 
simply a pentagon: a set of 5 points no 4 of which are coplanar. There would 
perhaps be no call to treat a pentagon as a quadric if spaces of higher dimen- 
sions were not to be encountered; but these pentagons are solid sections of 
quadrics in higher space and their properties call for interpretation in this 
context. The pentagon has, for instance, tangent planes; these are 


yt+2+t=0, z+t+x=0, t+x+y=0, x+y+2=0 


at the vertices of 7, and x + y + 2 + ¢ = Oat the unit point. They are planes 
h; whereas the 10 faces of the pentagon are all 7. Each 7 has for its pole the 
kernel of the conic that is its intersection with p; this kernel is the point 
common to j and the join c of the two points of p outside j. The 35 lines of 
[3] are distributed as 10 c (edges of p), 10 s (1 in each j) and 15 ¢ (3 through 
each vertex of p and lying in the tangent plane / there). Each ¢ is its own 
polar, but ¢ and s are polars of each other. 


13. The non-singular quadric in [4]. A non-singular quadric Q in [4] 
consists of 15 m, one on each of the 15 lines fp) through its kernel k. The tangent 
solid at m contains k, and through k there pass, in this solid, 6 lines other 
than km. But each such line contains a point of Q which, being conjugate to 
m, is joined to m by a g; thus each of the three planes which lie in the solid 
and pass through km contains a g through m, and these g form the section 
of Q by the solid, a solid which may therefore be designated by y. Q is thus 
the exemplar, over F, of the figure (1, frontispiece) of 15 points lying 3 by 3 
on 15 lines in [4]. The 3 g through an m span y, and one idiosyncrasy of the 
figure consequent upon the base field being F is that the 15 y are concurrent 
(at R). 

Let m, be a given m. There are 15 — 1 — 6 = 8 m not conjugate to m, and 
whose joins to m, are therefore c; hence Q has 3(15 X 8) = 60c. The plane 
ck contains not only m,, m2, say on c but also m; on the join of k to that point 
p on ¢ other than m,, mz themselves. Hence there are 20 planes jo through 
k, each meeting Q in 3 m that form a triangle whose sides are all c; such a 
plane includes a line s skew to Q. As there are in all 35 planes through k they 
are accounted for by the 20 jp and by the 15 e which join k to the 15 g. 

The polar of a line uw not passing through k is a plane through & and lies 
in the polar solids of all points in the plane wk. If u is g the polar is gk. But 
if uw is s the polar will be jo’, skew to s because no two points on s are conju- 
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gate. Thus the 20 s are paired; the joins jo, jo’ of a pair s, s’ to k are polars 
of s’, s respectively. Every point on s is conjugate to every point on s’; the 9 
transversals of s, s’ are all ¢ and the solid x spanned by s and s’ meets Q in 
a hyperboloid with s, s’ for its Dandelin lines. These 10 solids « = [s s’], con- 
taining 6 g that fall into complementary reguli of 3 lines each, are the exemplars 
of what Baker (1, p. 115) calls the singular solids. When lines are skew in 
Baker's frontispiece the corresponding lines on Q are skew; hence the 15 g 
form 6 quintuples of mutually skew lines, each g belonging to 2 quintuples and 
being determined as their common line. Each « answers to a partition of the 
6 quintuples into complementary triads and it is, as in the classical case, a 
consequence of this that the plane common to two solids «;, «2 is a plane f of 
a pair of intersecting g. The third solid through f is, of course, the ¥ which 
joins it to k. 

Of the solids in [4] 16 do not contain k; those 6 of these that are not « are 
\; each contains 5 m and the triads of g concurrent thereat together account 
for all 15 g. In the analogous classical figure the corresponding pentagons 
certainly occur: each of the 15 points corresponds, via the 3 lines concurrent 
there, to a syntheme (that is, a partitioning into three pairs)of the 6 quintuples; 
when 5 synthemes constitute a synthematic total (that is, they together account, 
by the 3 pairs in each, for all 15 pairs of quintuples) the 5 corresponding 
points form one of the pentagons in question. Since there are 6 synthematic 
totals there are 6 pentagons; but the vertices of a pentagon need not lie in 
a solid. That they do so here is another consequence of the base field having 
characteristic 2. 


14. An incidence table for Q is now given; a zero suffix indicates that a 
space passes through k, whereas its counterpart without the suffix does not. 
The upper half, above the diagonal, shows the number of subspaces, indicated 
by a letter on the extreme right, that lie in a given space indicated by a 
letter at the top. The bottom half, below the diagonal, shows the number 
of spaces, again indicated by a letter on the right, that contain a given sub- 
space. Either half is deducible from the other: for example, since there are 18 
of the 60 c in each of the 10 « the number of « through a given c is 18 K 10/60 
= 3. 


15. The whole figure can be described succinctly by using a supernumerary 
set of 6 homogeneous co-ordinates whose sum is zero. Then the 15 points 
having two zero co-ordinates constitute Q, and collinearities occur for points 
such as 

(0,0, 1, 1,1, 1); (1, 1,0, 0, 1, 1); (1,1, 1, 1,0, 0). 


Thus in this representation it is the g, not the m, that correspond to synthemes; 
the m, not the g, that correspond to pairs. Q is the section of the non-singular 
quadric 2y<,x x, = 0 in [5] by the [4] 


Xo + Xi +X. + Xs +X + x5 = Oz 
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TABLE I 
1 15 15 15 15 45 60 20 15 15 20 60 45 15 10 6 
k p> ™ S @ 8 8 © .. €: a me 2 v « r 
1 1 : 1 k 
1 2 1 3 a ae oe 7 6 10 p 
3061 . 2 1 i 2 as 7 9 5 m 
ONS 1 2 3 6 g 
15 1 1 3 3 5 7 to 
66 63 3 3 3 l 9 9 15 2 
4 8 = ie 12 18 10 ¢ 
4 4 1 1 : 4 2 10 s 
6 1 . 1 : 3 1 5 A 
16 3 3 a; oS? g ; 3 e 
20 4 4 4 = 4 js 
16 12 4 3 3 4 6 10 j 
6 15 6 1 3 3.69 f 
15 7 7 3 7 3 3 3 1 3 3 1 1 v 
4 6 4 2 3 1 1 2 K 
4 2 2 1 3 2 1 
Examples of equations for subspaces of this [4] are as follows. 
A:x%, = 0. 
K2X, + Xe + x3 = O = x4 + X5 + Xo. 
YW :x%,+ x. = 0. 
h Xi = Xo = 0. 
I 32, +X. = X3 = X%. 
jo 2X1 = Xo = X3. 
j +X + X22 = X%3 = 0. 
€:X, +X. = X3 + Xy = X5 + Xo. 
SiXy = Xe = X3 = 
C:X%, = 0, X2 = X3 = Xe. 
tix) = Xe = X3 + X= X5 + Xo.z 
lo : Xe = X3 = Xq = Xp. 
The 6 homogeneous co-ordinates related by =x; = 0 were used, though not 


over a finite field, by Castelnuovo (3, p. 885) in his study of linear complexes 
in [4]. Such a complex sets up, as does Q, a null polarity which, as with Q, is 
degenerate (which it cannot but be in a space whose dimension is even); the 
kernel of Q is analogous to what Castelnuovo calls the centre of the complex. 
Castelnuovo’s statement (3, p. 862) that it is from its centre, and its centre 
only, that the lines of a linear complex in [4] are projected into those of one 
in [3], is valid in the finite figure. For, since the lines of a screw through a 
point of [3] are coplanar, any eligible centre of projection has to lie in the 
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solid that contains the lines of the complex at any point of [4]; but the g at 
a point of Q span y, and the only point common to all y is &. Thus projection 
from k onto a [3] which must, as not passing through k, be « or X, gives 15 
points lying 3 by 3 on 15 lines, the 3 lines through any point being coplanar. 
These lines constitute a screw (4, p. 325). If the projection is onto « it is at 
once seen that the 15 lines consist of triads forming complementary reguli 
and of the tangents to the quadric, one at each of its 9 points; that these 
constitute a screw is known. (The map of complementary reguli on the Klein 
quadric in [5] is by conics in a pair of polar planes: when the base field is 
F these planes intersect at the kernel of both conics, and so lie in a [4].) This 
accords with the known isomorphism (9, p. 17), when the base field has 
characteristic 2, of the orthogonal group on 5 variables with the symplectic 
group on 4. 


16. The non-ruled quadric in [5]. It is now, with the facts assembled 
in the preceding sections, a straightforward matter to give a conspectus in 
[5] of the geometry of FH the quadric introduced in § 2. Y consists of 27 m; 
the tangent prime M at m meets 7 in a cone that projects a p from a point 
outside its [3]; instances are the bounding primes of 2» and Zo’ (cf. the footnote 
to § 7). The polar prime P of any of the 36 p off Z meets 7 in a non-singular 
quadric having p for kernel; the 15 lines through p in P are all t, and are 
the only ¢ through 9; of the 16 lines through p outside P 6 are (§ 2) c; the 
remaining 10 must be s. Since the numbers of p on t¢, c, s are 2, 1, 3 the total 
number in [5] 


of t is 36 X 15/2 = 270, 


of c is 36 X 6 216, 
of s is 36 X 10/3 = 120. 


Any other line is a g, and as there are, through any m, 5 g projecting the 
vertices of a p the total number of g on 7% is 27 K 5/3 = 45. 

Each line has a polar solid. The polar of c is \: for example, each bounding 
[3] of either 2» or Zo’ is the polar of an edge of the other. The polar of s is «: 
for example, x4 = X5 = Xo is the polar of x, = x2 = x3 = x4 + %5 + xo = O. 
As for the polars of t and g, let my be any point of -% All ¢ and all g through 
my lie in the tangent prime Mo; hence their polar solids all lie in Mo and contain 
my. They thus project, from mo, the polar planes j, h with respect to p of 
points p, m in the [3] wherein p lies; thus ¢ has a polar solid y and g a polar 
solid x. 

It only remains, before compiling Table II, to discuss the planes in [5] and 
their relations to 4% A plane through g in the polar x of g meets f in g only, 
and so is an ¢; ¢ is its own polar, every point therein being conjugate to every 
other. There are 135 e, 3 through each g. Also there are 270 f, spanned by 
pairs of g. Since f is spanned by a pair of g its polar is the intersection of a 
pair of x and so is an h—the only type of plane other than e¢ to lie in x. The 
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10 f and 10 A through m project the 10 ¢ and 10 s of a p. Finally there are 
j, for instance the plane faces of Zo. Since j contains an s its polar lies in a 
« and so is also a j—the only type of plane other than f to lie in x. Now while 
each « contains 6 j the « through a given j is, with the s in the polar j, unique. 
Hence there are 720 j, consisting of 360 polar pairs. The kernel of the section 
of # by j, being conjugate to every point of j, lies in the polar plane; thus 
the j of a polar pair have in common the kernel of both their sections of 7% 
There are 10 of the 360 polar pairs through each of the 36 p, and they are 
(§ 13) the pairs of polar jp in P. 


17. Table II can now be compiled. In order that it be endowed with central 
symmetry one refers the lower half, registering the numbers of spaces through 
any given subspace, to the column of symbols on the left and the row of 
symbols below; the upper half, registering the numbers of subspaces in any 
given space, is indexed, as was Table I, by means of the row above and 
the column on the right. This enables one to arrange the symbols on the 
left and right so that those in either column indicate the polar spaces of those 
in the other read in exactly the reverse order; the same applies to the rows 
at the top and bottom. 

A notable feature of the geometry is the distribution of the 120 s in trios 
s, s’, s such that the polar « of each member of the trio is the solid spanned 
by the other two. If the vertices of the simplex of reference were chosen 2 on 
each of s, s’, s’” the equation of & would be, since the quadratic giving the 
intersections with any of s, s’, s”’ is irreducible, 


ye + Voy + Vi + ¥2 + Yes + Vs + VET Yes + HS = O. 


G is thus representable as the group of automorphisms of this quadratic 
form, and its order is instantly calculable now it is known that there are 
40 trios. For there are on each member of a trio 3 choices for the pair of 
reference points which, having been chosen, may yet be transposed; also the 
members of the trio may undergo permutation. Hence G has order 


40. (3.2)°3! = 51840. 


18. Table II is intimately related to Table I of (7); it is natural that this 
should happen because the two tables depict closely allied representations 
of the same group—the cubic surface group of order 51840. Both representa- 
tions are in projective space of 5 dimensions, but here the base field F is finite 
whereas in (7) it is the field of complex numbers. The main feature of contrast, 
among a multitude of similarities, is, however, that no analogues of the 27 m 
appear in (7), where the whole figure is based on 36 points po; this forestalls 
any possibility of there appearing in (7) the analogues of any spaces whose / 
are inadequate to span them; that is, of g, whereon no p occurs, of c, whereon 
there is but a single p, of f, whereon the p are collinear. 
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Miss Hamill’s nomenclature for the subspaces of the figure in (7) is easily 
translated into that used here, but she builds solids and primes of more varied 
kinds and in larger numbers, and one must explain why this is. The following 
short dictionary 


Po é) 71 71 X Po Ce m1 X 1 
p t s j h K 


shows how spaces whose symbols are below are denoted, in (7), by the symbols 
above: ¢; is defined as a line spanned by two conjugate po whereas ¢ is spanned 
by two conjugate p, while y; X y: is the translation of the fact that «x is 
spanned by its Dandelin lines. But consider, for a moment, a solid y. It 
contains g:, g2, gs, concurrent at mo, and 8 j, namely those planes in y which 
do not contain mo, consisting of 4 pairs, a pair together containing all points 
On g1, g2, gs Other than mp. The intersection of such a pair of j is an s, and 
the 4 s are joined to my by the same h—the only plane in ¥ through mp that 
does not contain any of gi, g2, gs. Now the kernel of the conic in any of the 8 j 
is on ¢, the line in ¥ through mp» that is not in any of the planes gogs, gag, 
£:g2; thus, mp itself being conjugate to each point in y, each point of ¢ is 
conjugate to each point of 4. Hence y may be spanned by ¢ and any of the 
s in h, and it is because there are 4 choices for this s that there occur 1080 
solids y: X ¢:, in (7)—4 times the number 270 of ¥. Moreover y can be 
spanned by A and either of the p on ¢, and it is because there are 2 choices 
for this p that there occur 540 solids cz X o in (7). Likewise for primes. There 
are in P 6 X, which explains why there are 6 times as many primes 3 X po in 
(7) as there are P; and there are 20 s falling into 10 pairs, each pair spanning 
a «x as its Dandelin lines, which explains why there are 10 times as many 
primes y: X vy: X po as there are P. 


19. The cubic surface group. The order of the group of senary linear 
transformations that leave # invariant can be calculated by appealing to 
the properties (§ 3) that the columns of the corresponding matrices M must 
have. One has to choose a hexad of m whose 15 joins are all c. For m, there 
are 27 choices; then 16, the number of c through m, for m2; then 10, the 
number of j through c, for m3; m, has to lie in a \ through j, and there are 3 
such A, but each contains two further m, vertices of a p in \, that are not 
conjugate to any of m, m2, m;. Thus there are 6 choices for m,. Denote, for 
the moment, the vertex of p that is not chosen by m’; it is not then eligible 
to furnish another column of M, since there must be no linear dependence 
between the 6 columns. On each of 


m2mM3, msm, m\M2, m m4, m2mM4, m3m,4 


there is a p, and this has to be (see the remark towards the end of § 3) con- 
jugate to each of the m yet to be chosen. The six p are coplanar, constituting 
(§ 12) with m’ the tangent plane to p at m’; their plane h has for its polar an 
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f wherein the 2 g meet at m’. The polar c of } lies in f, and the m thereon are 
not eligible; the others, one on each g, are, and complete the tally. Hence 
the number of M is 


27.16.10.6.2 = 51840. 


It is true that the classical properties of the 27 lines on a cubic surfacealso afford 
a means of obtaining this number, and have long done so; the choice for m, 
after m,, m2, m; are fixed is analogous to the choice, in Schlafli’s notation, of 
say, @, Or Css towards filling half a double-six once a@,, a2, a; are fixed. But for 
corresponding “‘orthogonal’’ groups in other numbers of variables there is 
no such precedent. 


20. The 6-rowed permutation matrices x constitute, 2.1 being symmetric 
in all 6 co-ordinates, a subgroup -“% of G. Each x has certain latent column 
vectors; the nature of the space o of whose points these are co-ordinate 
vectors depends on the cyclic decomposition of the permutation, and so on 
a partition of 6. For example: the partition 3* corresponds to the cyclic 
decomposition (xxx) (xxx) and so to a o with equations of the type 


Xo = X = Xz, Xe = X= Xs; 


this is a line s. For a partition A,A2..., ¢ has dimension 5 — Z(A, — 1). No 
two among the 11 partitions yield the same type of space for ¢, so that the 
x provide representatives of 11 conjugate sets in G. 


21. If «x is given, one can find the number of matrices conjugate thereto 
in G by calculating the order of the normalizer. One writes down the most 
general matrix, with all its elements in F, that commutes with 7, and then 
imposes all the conditions on its columns that are necessary for it to belong 
to G. If N is the number of such matrices the number of operations in G that 
are conjugate to mw is 51840/N. As an example take 


answering to 3?. Any matrix that commutes with # has the form 
Q@, G2 G3 Cy Co 
Q3 @, Ge C3 Ci Ce 
G2 G3 @ C2 C3 C 
91.1 2 3 1 2 3 1 
b, be b; d; d, d; 
b; by b» d; d, d, 
js bs b; dz d; d; 
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If a; + a2 + a3 = 5; + b2 + 5; = O, this is singular. Thus 
either a, + de + a3 = 5; + 5. + 8; = 1, 
when it is necessary, for 21.1 to belong to G, that 
G23 + 301 + A102 + bobs + babi + Bid. = 1, 
or one of a; + a2 + a; and 5; + be + 4; is 0 and the other 1, when 
A203 + G30, + A102 = bobs + bab, + bide = 1. 
It is not possible to satisfy simultaneously the two conditions 
yetoxrtuey=xt+yt+2=0; 


they represent a conic and the line in its plane that is skew to it. This enables 
one to tabulate the whole set of solutions of the above conditions. The out- 
come is, since c, d are subject to precisely the same restrictions as a, b, that the 
first 3, as the last 3, columns of 21.1 must be one of the blocks 


l = a 1 | 1 1 1 
] : 2B 9 l l am 
boll De e49 1 | 2 3 

:-.4 l = 7. : 3 

= oe oe Te 1 l 

= a l a ss 


or else be derived therefrom by cyclic permutation of its 3 columns. The 
order of the normalizer of z is therefore, since the same block must not occur 
twice, 4.3.3? = 108, and z belongs to a conjugate set of 480 matrices in G. 

The above 12 columns are co-ordinate vectors, of points outside 
Xo + xy + Xe. = X3 + x4 + X5 = 0, no two of which are conjugate. 


22. There is another way of finding how many operations compose a con- 
jugate set in G once the space o constituted by the invariant points is known. 
One first writes down the most general matrix, with all its elements in F, 
having all the points of ¢ among its latent column vectors: this is secured by 
stipulating the invariance of a set of points that span ¢. One next imposes 
the restrictions on the columns of this matrix to ensure that it belongs to G. 
One has then to exclude any matrices that leave invariant not merely the 
points of ¢@ itself but also those of some space of higher dimension that con- 
tains o; this last stage is liable to be the more complicated the lower the 
dimension of ¢. As an example, let ¢ be a plane h. This does not happen for 
the permutation matrices, but it illustrates the points at issue equally well 
and provides another conjugate set. Take, for h, 


Xo =X = Xe + X3 + X4 + x5 = 0. 
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vector of an m, 





(a + 1)(6 + 1) 








A 1B, + A 2B, 


These allow A, 


A, = B, = 0, 
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Every point therein is invariant if 3 linearly independent points are; hence 
one requires a matrix having 


among its latent column vectors, and so having its last 4 columns of the 


‘ 
d 


e+ 1 
f 


Since each, as the co-ordinate vector of a point, must be conjugate to the 


sum of any two others, c f; and since each must be the co-ordinate 


One now has to prefix 2 columns, each conjugate to the sum of any 2 of the 
+; thus there ensues 


The conditions on the first two columns are 


= B, = 0; but this is inadmissible since it 
permits every point of the space x9 + x; = x2 + x3 +%*,+%5 = 0 to be 
invariant. Hence 

As 


Next, the first and third columns would be conjugate unless a = a, the 
second and third unless 8 = 5; moreover a, 8 must be unequal, for otherwise 
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the ¥ xo + x1 = X2 + X3 + x4 + x5 = 0 would have all its points invariant. 
Since a, 6 are thus unequal, 22.1 gives c = 1, and the matrix becomes 





‘ 1 a a a a 

l a+ il a+l1 a+ 1 a+ 1 
a a+l1 ; l 1 l 
a a+ 1 1 ; l 1 
a a+l1 l l l 
a a+ 1 l l 1 





Hence there are, corresponding to a = 0 and a = 1, two matrices in G leaving 
invariant the points of a given h and no point outside this plane. That they 
are conjugate in G follows because each is obtainable from the other by 
simultaneous transpositions of the first two rows and of the first two columns, 
and so by transformation by a permutation matrix. Each is inverse to the 
other, and both have for their square an involution for which every point 
in the ¥ xo + x1 = x2 + x3 + x% + x5 = 0 is invariant. They are representative 
of a conjugate set of 540 operations of period 4 whose invariant points consist 
of 6 p and an m, a fact that must be mirrored in other representations of G. 


23. The following list shows, for each partition of 6, the type of equations 
for the space o of invariant points of the corresponding permutation matrices; 
it also says what kind of space o@ is, gives the Roman numeral that labels 
the corresponding conjugate set in (6) and (5), and the number of operations 
of G in this set. Several facts that are evident from this list have to agree 
with known results; the period of an operation can be seen from the partition, 
as can the nature of the different powers. For instance: since 6 has 3? for its 
square and 2° for its cube, the operations labelled XXIII must have their 
squares labelled 1X and their cubes labelled XVII. Moreover, the numbers 
of m and p that are invariant for a given operation, being those of points in 
its invariant space, are known and must be the same as numbers already 
encountered in other representations of G. 





TABLE III 
16 [5] I 1 
142 xX; = Xe P XVI 36 
133 x, = Xe = X3 K VI 240 
1%4 X) = Xe = Xp = X% f XVIII 1620 
17922 x, = X2, X3 = X% y Il 270 
123 x, = Xo, X%3 = X% = Xs j XXI 1440 
15 x) =X. =X; = Xy = Xs c XV 5184 
6 Xo =X =X. = Xs = X%y = Xs p XXIII 4320 
24 Xo =X, Xe = Xo = Xy = Xs t V 3240 
23 Xo = Xy, Xe = Xs, My = Xs e XVII 540 
32 Xo = X = Xe, Xo = Xy = Xs s IX 480 
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24. Not every type of subspace in [5] has appeared in the role of ¢. Some 
cannot. Were A invariant so would its polar c be, and on ¢ lies a single p 
which would be invariant too. Thus the hypothesis that every point of J is 
invariant, implying the invariance of a p outside A, implies the invariance 
of every point of the join P of \ and »p. Nor, as is easily seen by taking a 
face of Zo, can every point of an M be invariant. It has, however, been seen 
that o might be 4; this gives the set XIX. Furthermore, every point of 


X: Xo +X = Xo + Xs = Xa + Xs 


is invariant for the involution 





. «- 2. 2 1| 
» &2 BOR ES 
,s 1 1 
es > 2S 2 
aa 

> o-. 1 





and for no other operation of G save I. This involution is one of 45 in the 
set III. 


25. The 36 involutions J associated one with each p are conspicuous as 
being the only operations in G whose invariant points fill a [4]. Suppose 
that p:, P2 are such that J;J2 = J2J;. If m is outside both P; and P:, and 
if m, and m2, the remaining points on mp, and mp, respectively, are those 
points into which m is transformed by J; and J2, commutation demands that 
the intersection m2 of myp2 and mzp, is on Z% The quadrangle mm,m ym, has 
1, P2 for diagonal points; since F has characteristic 2 the third diagonal point 
is the remaining point of p:p2. Now any plane that includes 4 m has to be an 
f and include 5; hence pip is a t. Conversely, let pipe be a t, » its polar solid; 
as pi, p2 are conjugate P; and P; both contain y. Every point in y is invariant 
for both J; and J2; the 16 points of P; outside y, collinear in pairs with ps, 
are transposed in pairs by J2; the 16 points of P: outside y are transposed 
in pairs by J:; hence the successive application of J;, J. has the same effect, 
in either order, on any point in either P; or P,. Any point of [5] that is outside 
both P; and P: is on lines that meet them both, and is the only point on any 
such line, it follows that it, too, is changed into the same point by successive 
application of J;, Je, in either order. Thus JiJ2 = J2J,. 

This discussion has shown that J, J. do or do not commute according as 
pip is a t or an s. Thus non-commuting J occur (6, p. 86) in associated sets 
of 3. The product of any number of mutually commutative J is itself an 
involution; each factor is associated with a p and every pair among these p 
has its join a ¢. In particular: the 4 points in an e not on the g therein have 
this property; the product of the 4 corresponding J is the involution whose 
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invariant points compose the polar x of g. As there are 3 e through g in x 
there are (6, p. 88) three ways of expressing this same involution as a product 
of 4 commutative J. 

The conjugate sets in Table III are identified by representatives that impose 
certain permutations on the vertices of 2». The mutually disjoint cycles that 
compose the permutation correspond, when appropriately expressed as pro- 
ducts of transpositions, to those r-chains used by Frame that have r < 6. This 
is why the same 11 partition labels have appeared among the 18 at the 
bottom of p. 91 of (6). 


26. G is a subgroup, of index 28, of the “group of the bitangents,’’ and 
this larger group happens too to have a representation in 6 variables over F, 
being isomorphic to the symplectic group of matrices wu for which »’Syu = S, 
when S is a non-singular skew matrix. A referee has proposed that this occur- 
rence as a subgroup of the symplectic group be shown in the present context, 
and so this § 26 is appended. 

Any projectivity that leaves /% invariant turns conjugate points into 
conjugate points and non-conjugate points into non-conjugate points; hence, 
with / in the symmetric form of § 2, the bilinear form yz, must be 
unaltered under each of the 51840 projectivities. But this bilinear form is 
y'Bz, z being the column vector whose 6 components are the z,, y’ the row 
vector with components y,, and B the non-singular 6-rowed matrix mentioned 
at the end of § 6. But B, symmetric and with a zero diagonal, is skew over 
F, so that all 51840 M belong to the symplectic group I on B. 

The condition that M belong to [T is less stringent than the condition 
that M belong to G; for the symplectic property is not that a quadric but 
that a polarity is left invariant and, as was noted in § 5, the same polarity 
arises from 64 quadrics. The wider latitude of the symplectic condition permits 
these quadrics to undergo permutation. Yet a ruled quadric can be permuted 
only with other ruled quadrics, and likewise for quadrics, such as % that 
are not ruled. One presumes, knowing the index of G in I, that 28 of the 64 
quadrics are not ruled while the remaining 36 are ruled. 

All 64 quadrics, inducing as they do a non-degenerate polarity, are non- 
singular; in order that one be ruled it is enough for it to contain a single 
plane. If A is a point on the quadric, any plane thereon through A has to 
lie in the null prime of A. When 7 has the form 2.1 and B corresponds thereto 
the null prime of (1, 1,1, 1,1, 1) is 


Xo + Xi + Xo + x3 +44 + x5 = 0, 


and the conditions for 


5 
26.1 > axit+ d xx, =0 
t=—0 


i<j 
to contain the plane x» + x; = x2 + x3; = x4 + x5 = O are 


ao + a) = 2 +03 =~ 4+ a5 = 1. 
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Thus 3 squares must be absent from 26.1, 3 present. Again: the null prime 
of Xo (see § 2) is 


Xo = Xo +X + Xo + Xs + X4 + Xs, 


and the conditions for 26.1 to contain the plane x; + x2 = x3 + x4 = x5 = 0 
are 
ay = 0, a4, + d2 >a; +a, = 1. 


The mark a; can be either 0 or 1; taking as = 0 shows 26.1 to be ruled when 
2 squares are present, 4 absent. So 20 + 15 = 35 ruled quadrics are accounted 
for. 

The ruling would, of course, be patent if 26.1 could be thrown into Pliicker’s 
form. For example: if all 6 squares are present 26.1 is 


(xo + X1 + X4) (Xo +X) + 2X5) + (Xo + Xs + Xo) (Xo + X3 + 2X) 
+ (X%q + Xs + Xo) (X4 + X5 + X33) = O. 


REFERENCES 


1. H. F. Baker, Principles of geometry, vol. 4 (Cambridge, 1940). 

— A locus with 25920 linear self-transformations (Cambridge, 1946). 

3. G. Castelnuovo, Ricerche di geometria della retta nello spasio a quattro dimensioni, Atti del 
reale Istituto Veneto (7), 2 (1891), 855-901. 

4. W. L. Edge, The geometry of the linear fractional group LF(4,2), Proc. Lond. Math. Soc. 
(3), 4 (1954) 317-42. 

5. ———— The conjugate classes of the cubic surface group in an orthogonal representation, 
Proc. Roy. Soc. London, A 233 (1955), 126-46. 

6. J. S. Frame, The classes and representations of the groups of 27 lines and 28 bitangents, Ann. 
Mat. (4), 82 (1951), 83-119. 

7. C. M. Hamill, A collineation group of order 2".35.5*.7, Proc. Lond. Math. Soc. (3), 3 (1953), 
54-79. 

8. E. Study, Gruppen sweiseitiger Kollineationen, Nachrichten von der Kéniglichen Gesell- 
schaft der Wissenschaften zu Géttingen (Math.-phys. Klasse) (1912), 453-79 

9. B. L. van der Waerden, Gruppen von linearen Transformationen (Berlin, 1935). 


~ 


University of Edinburgh 











POLYTOPES OVER GF(2) AND THEIR RELEVANCE 
FOR THE CUBIC SURFACE GROUP 


H. S. M. COXETER 


1. Introduction. In the preceding paper, Edge represented the cele- 
brated “cubic surface group” of order 72.6! = 51840 as the group of auto- 
morphisms of a senary quadratic form over the field of residue-classes mod 2. 
The object of this sequel is to compare Edge’s finite space with a real space, 
thus identifying his non-ruled quadric in PG(5, 2) with a modular counter- 
part of the semi-regular polytope 22; which was discovered by Gosset in 1897. 


2. The automorphisms of a real quadratic form. The positive definite 
senary quadratic form 
Es = (x')? = xx? oo (x?)? = x 2x3 a (x*)? _ xix4 a (x*)? —_ xix (x5)? 
—_ x36 (x*)? 


is conveniently symbolized by the “‘graph”’ 


—— 


in which the nodes represent the “‘square’”’ terms while the branches represent 
the “‘product” terms (4, pp. 192, 297). The form evidently possesses six in- 
volutory automorphisms (or ‘‘automorphs’’) R,,..., Rs, defined as follows. 
Suppose the kth node of the graph is joined to the ith, jth, etc. Then R, leaves 
all the x’s unchanged except x*, which it transforms into 


—x*+x'+x/+...; 
for instance, R; is 
(x!, x7, x3, x4, x5, x8) — (x), x?, x? — x? + xt + x8, x4, x5, x). 
The six R’s generate a group of order 51840 which is the whole group of auto- 
morphisms (omitting the trivial automorphism that replaces every x‘ by 
—x"‘) (5, p. 422). To identify this with the group of the 27 lines on the general 


cubic surface, we observe that its generators satisfy the relations R,? = 1 and 
either 


R:R;R; = R,R.R; or RR; = R;R, 
according as the ith and jth nodes of the graph are, or are not, joined. These 
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same relations are satisfied by certain permutations of the 27 lines on the 
cubic surface (3, p. 458), namely: 


R; = (1 2) = ~ bi Cos Coa Cos — 


Qo be Cis Cra Cis Crs 


R; = (2 3), Rs = (3 4), Re = (4 5), Rs = (5 6), 


Cos Cis Cig Gq Ap Me 
r= ( ). 


by be bs Cee Cas Cas 
The Diophantine equation 
Es = 1 


has 36 pairs of solutions, which are easily derived from any one of them, 
say (1, 0, 0, 0, 0, 0) or 


10000 
0 ’ 
by applying the antomorphisms; for example, by applying R2, Rs, Ra, Rs, 
Rs successively we obtain 


11000 =11100 411110 41111 IL1111 
0 0 0 0 l 


When regarded as points in an affine 6-space with a Euclidean metric given 
by the form E¢, these 72 solutions are the vertices of Elte’s six-dimensional 
uniform polytope ly. (1, p. 372; 2, p. 472; 3, p. 470; 4, p. 210; 5, p. 414). 


3. The transition from the real field to the field with two elements. 
Most of the above remarks remain valid if we regard the x’s as marks of the 
field GF(2), that is, if we work in the finite affine 6-space EG(6, 2). Since 
—1 = 1, the minus signs in Es, can now be replaced by plus signs, and the 
description of the generating automorphisms becomes still simpler: R, “‘in- 
creases” the co-ordinate x* by the sum of its (one, two, or three) neighbours. 
Another simplification is that the 36 pairs of opposite solutions of Es = 1 
become 36 single solutions which we can identify with the 36 vertices of an 
“elliptic” 122, or with the 36 double-sixes on the cubic surface. 

Since GF (2) has only the two elements 0, 1, every point of EG(6, 2) that 
does not satisfy E, = 1 must satisfy Es = 0. One such point is the origin. 
The remaining 27 are the vertices of Gosset’s six-dimensional polytope 22, 
(3, p. 466; 4, p. 202) for which it is natural to use the same symbols that 
Schlafli invented for the 27 lines. In fact, the point 


01000 
1 


must be a, since it is invariant under all the generating automorphisms 
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except R,; and by applying these automorphisms in an appropriate order, 
we obtain the whole set of 27 as follows: 


a, ae a3 as a5 
01000 11000 10000 10100 10110 
1 l l l l 
as6 C56 C46 C26 
10111 10100 10110 10010 
l 0 0 0 
C26 C16 Cis C14 C13 
11010 01010 01011 01001 01101 
0 0 0 0 0 
C12 C23 C24 C25 
00101 11101 11001 11011 
0 0 0 0 
C35 C45 C34 by be 
10011 10111 10001 11101 01101 
0 0 0 l l 
b; bs bs be 
00101 00001 00011 00010 
l l l l 


By regarding the six x’s as homogeneous co-ordinates, we thus obtain a 
partition of the 2° — 1 = 63 points of the finite projective 5-space PG(5, 2) 
into 27 which lie on the quadric E, = 0 and 36 which do not. In other words: 

In PG(5, 2), a non-ruled quadric is a polytope 22, and the rest of the space 
is the “‘semi-rectprocal”’ polytope 1.2. 


4. The polytope 2,;. It is known (1, p. 414; 3, p. 465; 4, pp. 202, 203) 
that the uniform polytope 2.2; has 27 vertices 
Gio cccs Gat Ge. sce Gems << em 


216 edges such as @,;42; and 720 triangular faces such as a;@2@3. Referring to 
Table II of Edge (6, p. 643), we readily identify these with his 27 points m, 
216 lines c, and 720 planes j. 

The polytope has five-dimensional cells of two kinds: 36 pairs of opposite 
simplexes 22 = as such as 


@, G2 G3 G4 Asp Ae, bi be bs by bs be 
and 27 cross-polytopes 2;, = 8; such as 


a a2 a3 a4 ds 
Cw C26 C36 C46 Coe. 
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Edge (6, p. 626) has made the important observation that, when any one 
of the 72 simplexes a; is used as simplex of reference, the quadric has the 
symmetrical equation 

> xx, = 0. 


i<j 


When the 6 + 6 vertices of a pair of opposite simplexes have been removed, 
the remaining 15 vertices belong to a polytope 03; = tas (1, pp. 360, 372) 
such as 


Ci2 Cig...» Coe, 
lying in one of Edge’s 36 hyperplanes P. 
Other central sections of 22; include 45 triangles such as 
a, be Ciz 


(Edge’s 45 lines g), which correspond to the 45 triangles on the cubic surface, 
and 120 triangular double-prisms {3} & {3} (3, pp. 463, 466) such as 


C23 Cis C46 
ade b, Ci2 
b; as Cas 


(Edge’s 120 spaces k), which correspond to the 120 pairs of Steiner trihedra. 
The latter fall interestingly into 40 sets of three (3, p. 466). 


5. The polytope 1... The “elliptic’’ 1.2, which differs from the ordinary 
le» (1, p. 414) in that pairs of opposite elements are identified, has 36 vertices. 
In PG(5, 2), these are Edge’s 36 points p. Each is the centre from which two 
opposite simplexes as; of 22, are in perspective; for example, the simplexes 
@; Gz G3 Ay As Ag and by de bs by bs be Of § 4 are evidently in perspective from 
the vertex 

10101 
0 
of Los. 

The principal central sections of the ordinary 122 are given in Table IV of 
(2, p. 476). These, with their numbers inserted, are as follows: 


36 «{ } 36 eas 

120 {6} 120 {6} + {6} 
270 {3\ | 270 {4} 

45 {3, 4, 3} 

216 Cals 

27 tiBs 


The “36 { }” are merely the diagonals joining pairs of opposite vertices, 
and so correspond to Edge’s 36 points ». The 120 hexagons (which reduce to 
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triangles when opposites are identified) are his 120 lines s. The 270 cubocta- 
hedra (which reduce to complete quadrilaterals) are his 270 planes h. The 
45 {3, 4, 3} (which reduce to desmic triads of tetrahedra) are his 45 spaces 
x. The 216 ea, (which reduce to complete pentahedra) are his 216 spaces X. 
The 27 4,8; (1, pp. 354-360) are his 27 spaces M. The 36 eas (1, p. 366) are his 
36 spaces P. The 120 pairs of hexagons in completely orthogonal planes are 
his 120 spaces x. Finally, the 270 squares (which reduce to point pairs) are 


his 270 lines ¢. 
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ON THE STRUCTURE OF HALF-GROUPS 
JAMES V. WHITTAKER 


1. Introduction. Furstenberg (1) and the author (4), among others, 
have exhibited postulate systems for groups in terms of the operation 
x —y=x+ (-—y). Furstenberg also investigated a system obtained by 
removing one of his postulates which defines what he called a half-group. A 
structure theorem for half-groups was given in (1). In the present paper, we 
prove another structure theorem for half-groups. A more restricted entity, 
called a pseudo-group, is introduced, and its structure, together with that of a 
half-group satisfying the left and right cancellation laws, is studied. Finally, 
some topological questions concerning their structure are also considered. 


2. Pseudo-groups. Let G be a set of elements over which is defined a 
binary operation x — y which satisfies, for any x,y,z € G, the following 
conditions: 

(i) x -y €G, 

(ii) There is an e € G such that x — y = e if, and only if, x = y, 

(ili) (29 — 2) — (y—2) =x-y. 

It was shown in (4) that, in terms of the operation x + y = x — (e — y), G 
is a group. It was also shown that if G satisfies (i), (ii), and 

(iv) (x — 2) — (x —y) =y —2, 
then G is a commutative group. 

We define a half-group to be a set with an operation satisfying (i) and 
(iii). A half-group which also has the property 

(v) There is an e € G such that x — x = e 
will be called a pseudo-group. Before proceeding to a discussion of half-groups, 
we shall consider briefly the structure of pseudo-groups. 

We define a set S to be an extension of a group G if G C Sand if there exists 
a single-valued function f defined on S such that f(x) € G for all x € S and 
f(u) = u for all u € G. The set S becomes a pseudo-group under the operation 


(1) xoy = f(x) — f(y), x,y €S. 
Indeed, xo y € GCS, and xox = f(x) — f(x) = e. Finally, 
(x02) 0 (yo) = f(f(x) — f(z)) —f¥) — f)) 
= (f(x) — f(z)) — YO) —f@)) =f) —fO) = xo», 
and (i), (iii), and (v) are satisfied. 


THEOREM 1. Every pseudo-group S contains a group G such that S is an 
extension of G and the operation in S satisfies (1). 
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Proof. Let x — y be the operation in S, and set f(x) = x — e. We will 
show that S is an extension of the group G = f(S). For any x, y € S, we have 
(x — y) —e= (x — y) —-(y-—y) = x-y. 

Setting y = e, we obtain f(f(x)) = f(x), whence f(u) = wu for all u € G. It 
also follows that if u,v € G, then u — v = f(u — v) € G, and G satisfies 
(i). Since (iii) and (v) hold in S, they held also in G. To verify the remaining 
half of (ii), we observe that if u,v € G and u — v = e, then, since f(u) = u 
and f(v) = », 

u=u—e= (u—v) — (e—v) =e — (e — v) 
=(v—v) —(e—v) =v—e=-v. 


Hence, G is a group. Finally, it follows from (iii) that, for all x, y € S, 
x — y = f(x) — f(y), and the proof is complete. 

In any pseudo-group S, we may introduce the sum x + y = x — (e — y) 
for all x, y € S. Then 


x+y =f(x) —fle — vy) = f(x) — (e — f(y)) = f(x) + f(y). 
Hence, this operation is associative. A pseudo-group is said to be commu- 
tative ifx+y=y+x. 
THEOREM 2. Jf S is a set with an operation satisfying (i) and (iv), then S is 
a commutative pseudo-group. 
Proof. For any x, y,z € S, we have 
(x — 2) — (y — 2) = ((2 — 2) — (2 — x)) — (2 — 2) — (2 — y)) 
=(z—y) — (2 —x*) =x-Yy, 
which proves (iii). Next, x — x = (x — y) — (x — y) = y — y = e. Since the 
group G = S — e also satisfies (iv), G is commutative. From the remark 
preceding Theorem 2, it follows that x + y = y + x for all x,y € S. 
In terms of subtraction, the left and right cancellation laws take the 
following forms: for all x, y,z € S, 
(vi) x — y = x — 2 implies y = z, 
(vii) x — 2 = y — g implies x = y. 
The next theorem shows, incidentally, that (vi) implies (vii) in a half-group. 
That (vii) does not imply (vi) can be seen by considering what we shall call 
the simple half-group, a set in which x — y is defined to be x. The structure 
of half-groups satisfying (vii) will be taken up in the next section. 
THEOREM 3. A half-group in which the left cancellation law holds is a group. 
Proof. Suppose that S satisfies (i), (iii), and (vi). For any u, x, y,2 € S, 
we have 
(u — x) —2 = ((u — x) — (x — x)) — (2 — (x — x)) 
= (u— x) — (2 — (x — x)). 
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Hence, 2 — (x —x) =z2=2—(y—y), andx—-x=y—y=elIfx-—y 
=e=x— x, then x = y and (ii) is satisfied. Therefore, G is a group. 


3. Half-groups. Following Furstenberg (1), we introduce the following 
definitions. We call an element a of a half-group H an idempotent if a — a = a. 
A subset K C H is called invariant in H if x — (x — K) C K for all x € H. 
If K is a sub-half-group of H which contains all idempotents in H, then the 
relation x = y defined by x — y € K is an equivalence relation. In fact, 
x = x since x — x isan idempotent. If x = zand y = z, thenx — y = (x — 2) 
— (y — 2) € K, and x = y. The resulting quotient space will be denoted by 
H/K. A weaker form of the following theorem was proved in (1, Theorem 4). 


THEOREM 4. In the half-group H, let K be an invariant sub-half-group con- 
taining all idempotents of H. Then H/K is a group. 


Proof. We refer to (1) for the proof that if u,v, x,y € Handu =x, = y, 
then u —v =x — y. Thus, if x* and y* are the equivalence classes con- 
taining x and y, respectively, then (x — y)* = x* — y*. Clearly H/K satisfies 
(i) and (iii). If e* is the equivalence class containing K, then x* — x* = é*, 
since x — x is an idempotent. Finally, if x* — y* = e*, then x* = y*, and (ii) 
is satisfied. Hence, H/K is a group. 

Let G be a group and {S,}a.4 a family of disjoint sets, indexed by the 
set A, with the following properties: 

(a) For each a € A, there exists a single-valued function f, mapping S, 
onto G, 

(b) For each a, 8 € A, there exists a one-to-one function ¢.43 mapping G 
into S, such that fa(¢ee(g)) = g for each g € G. 

We now define an operation xo y in H = S, as follows: if x € S, and 
y © Ss, then 


(2) XOY = Gas(fa(x) — fa(y)). 


We note that xo y € S,. Under this operation, H becomes a half-group. 
Since (i) is clearly satisfied, we have only to verify (iii). If 2 € S,, we have 


(x ° Zz) ° (y ° 2) das | fa (Pay (fa(x) = fy(2))) —_ fa(dpy(fo(y) — fy(2)))} 
dap((fa(x) — fy(z)) — (fey) — f,(2))) 


dap (fa(x) — faly)) = xoy. 


Il 


II 


Each S, becomes a pseudo-group in which ¢eefe plays the role of f in the 
representation of pseudo-groups given in § 2. If e is the identity of G, then 
daa(€) = € is the identity in S,, since xox = e, for all x € S,. The idem- 
potents in H are precisely the elements e, for all a € A. It follows from (2) 
that as(fa(x)) = x o eg for all x € S,. In particular, S, o é. is isomorphic to G. 


- 


THEOREM 5. Every half-group H can be represented as the union of disjoint 
pseudo-groups in which the operation is defined by (2). 
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Proof. Let E = {ealaca be the set of idempotents of H indexed by the 
set A, and, for each a € A, let S, = {x € H:x —x =e,}. If x € S, and 
y € H, then clearly x — y € S,, and S, satisfies (i). Since (iii) holds in H, it 
holds also in S,. Finally, S, satisfies (v) with e = e4, so that S, is a pseudo- 
group. In addition, a ~ 8 implies S, (\ Ss = 9, and H = U S,. 

We turn now to the construction of G and the functions appearing in (2). 
For any x, y € H and a € E, we have 


(x — a) — y = ((x — a) — (a — a)) — (y — (2 — )) 
(x — a) — (y-—a) = x-y. 


Similarly, if 6 € EZ, then x — (y — 6) = x — y. Combining these relations, 
we obtain 
(3) (x — a) — (y— 6) =x—-y. 


(For the sake of compactness, we shall also let (3) stand for the two preceding 
relations.) If we set F= E— E = {u —v:u,v € E}, then F is a sub-half- 
group by virtue of (3). Repeated application of (3) also shows that x — (x — 2) 
=x -—x € F for all x € H and z € F. Hence, by Theorem 4, H/F = G is 
a group. The corresponding equivalence relation will be written x = y, and 
the quotient mapping from H onto G will be denoted by h. 

For each a € A, let G, = S, — eg. We now define functions ¥.: H -G, 
as follows: 

Wa(x) = a — (€4 — x). 


For any x € H, we have e.g — x € S,. From Theorem 1 it follows that the 
difference of any two elements in S, lies in S, — @é, = Ga, whence Wa(x) € Gg. 
In addition, 


h(Wa(x)) = h(ea) — (hea) — h(x)) = e — (e — h(x)) = A(x), 
where e¢ is the identity of G. Hence, x = ¥.(x) and 
(4) x — Wa(x) = e, — @. 


If x € Ss, then x — ¥.(x) € Ss and e, — es € S,, whence y = 8. (Further 
calculation shows that 6 = a, but we shall not need this fact.) Thus every 
equivalence class intersects each G,. Furthermore, no two elements of G, are 
equivalent, that is, G,\ F = {ea}. In fact, if y € G, and y = e, — @&, then 
Y = ¥ — €a = (€a — €3) — Ca = Ca. Hence, the restriction of hk to G, is an 
isomorphism h, between G, and G. 

The functions f, are now defined to be the restrictions of h to S,, and the 
das are defined, for each a, 8 € A, and each g € G, by 


as(g) = hz*(g) — e. 


Since x — es = x for all x € S, because of (3), it follows that f.(das(g)) = g 
for all g € G, and conditions (a) and (b) are fulfilled. Moreover, if x € S., 
then h,~'(fa(x)) = x — éa, for each member is equivalent to x and lies in 
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G,., and such elements in G, must be identical. Similarly, if y € Ss, then 
ha~'(fa(y)) = Waly), for each member lies in G, and is equivalent to y. Finally, 
if x € S, and y € Sg, then, using (3) and (4), 

x—y= (x — v.(y)) — GY — %Y)) 
((x — €a) — Waly)) — (es — es) 
(ha* (fa(x)) — ha” (faly))) — ¢8 

= ha’ (fa(x) — fa(y)) — ¢3 = da0(fa(x) — fo(y)). 

This completes the proof. 


Coro.uary. Let K > E be an invariant sub-half-group of H. Then there 
exists an invariant subgroup N of G such that K/F = N and H/K =G/N. 


Proof. We adopt the notation of the proof of Theorem 5. Let A(K) = N. 
Then NX is clearly an invariant subgroup of G. Since the restriction of h to K 
is the canonical mapping from K onto K/F, N and K/F are isomorphic. Now 
let L = h~'(N) and k be the canonical mapping from H onto H/K. Clearly 
x —y€K implies x — y € L for all x, y € H. Conversely, if x — y € L, 
then there exists a z€ K such that x — y =2z(mod F). Since K DF, 
x — y =z (mod K) and 


k(x) — k(y) = k(x — y) = R(z) = @, 


where @ is the identity in H/K. Hence, k(x) = k(y) and x — y € K. 

Let @ be the canonical mapping from G onto G/N. For any x, y € H, the 
relation 6(h(x)) = 6(h(y)) is evidently equivalent to h(x) — h(y) € N and, 
hence, tox — y € L. Thus, the correspondence between H/K and G/ Ndefined 
by x* + 6(h(x)), where x* is the equivalence class (mod K) containing x, 
is one-to-one and onto. Finally, 


x* — y* = (x — y)* + O(h(x — y)) = O(A(x)) — O(A()), 


and the correspondence is a group isomorphism. 

Let {Ga}a«a be a family of disjoint groups, indexed by the set A, satisfying 
the following condition: 

(c) For each a, 8 € A, there exists a unique isomorphism 6.3: Gg — G,. In 
addition, for all a, 8, y € A, Oa9.0ay = ay. 

It follows from (c) that O.a(x) = x for allx € G,. We now define an operation 
xoy in H = UG, as follows: if x € G, and y € Gz, then 


(5) xOy = xX — Oag(y). 
Thus xo y € G,. If z € G,, then 


(x02) 0 (you) = (x — Ox(2)) — Oas(y — %,(2)) 
= (x — Oay(2)) — (Oap(¥) — Gay(z)) = xO. 
Hence, H is a half-group. Furthermore, ifx oz = yoz, thena = Sandx = y, 
whence H satisfies (vii). 
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We can define a second operation x @ y in H as follows: x ® y = x 0 (e,oy), 
where e, is the identity in G,. We have 


(6) x @y = x — Oay(e, — O(y)) = x — (€a — Gap(y)) = x + Oap(y). 


Thus x @ y is independent of y, and it is easily verified that this operation 
is associative. In addition, each e, is a right identity of H. 


THEOREM 6. Jf H is a half-group in which the right cancellation law holds, 
then H can be represented as the union of disjoint, isomorphic groups in which 
the operation is defined by (5). Moreover, H is a semigroup under the operation 
defined by (6). 


Proof. We adopt the notation of the proof of Theorem 5. It follows from 
(3) that, for allx € H anda € A, x — &g = (x — €a) — éa, whence, by (vii), 
x =x—e,. Thus S, = G,, and f, is one-to-one. From (b) we infer that 
das = fa' for all 8 € A. If x € G, and y € Gz, then 


x—y = fa (falx) — fo(y)) = x — fa'(fa(y)). 
If we set f.~'fg = 0.8, then the conditions of (c) are fulfilled and (5) is verified. 
The rest of the theorem follows from the discussion preceding it. 


Coro.iary. Jf H is a half-group satisfying (vii) in which the operation 
x ® y ts commutative, then H is a (commutative) group. 


Proof. lf x € G. and y € Gg, then x ® y € G, and y @ x € Gs, whence 
a=Bandx@®@y=x+y. 


4. Topological pseudo-groups. A topological pseudo-group S is a 
pseudo-group with a topology J in which x — y is continuous jointly in x 
and y. Thus G = S — e is a topological group in its relative topology. This 
definition, however, places very little restriction on the topology in the com- 
plement of G, and it will sometimes be necessary to impose the additional 
condition that the topology does not separate x and x — e: 

(viii) For every U € J and every x € S,x € U if, and only if, x —e € U. 

Let S be an extension of a topological group G with topology J(G), and let 
f be defined as in § 2. We introduce a topology J(S) into S as follows: 


(7) J(S) = {f'(U): U € J(@)}. 


With this topology and the operation defined in (1), S becomes a topological 
pseudo-group. To show that subtraction is continuous, let x,y € S and 
xoy € W, where W € J(S). Then there exist sets Uo, Vo € J(G) such that 
f(x) € Uo, fly) € Vo, and Us — Vo Cf(W). Hence, x € U = f-(Up), 
y€ V=f'(Vo), and Uo VC W. Since f(x) = x oe, (viii) is also satisfied. 


THEOREM 7. Every topological pseudo-group S which satisfies (viii) can be 
represented as an extension of a topological group in which the operation is 
defined by (1) and the topology by (7). 
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Proof. Let G = S — e and f(x) = x — e for x € S. Then G is a topological 
group in its relative topology, and f is continuous on S. From (viii) it follows 
that if U € J(S), then f-'(U (\ G) = U, which verifies (7): The rest follows 
from Theorem 1. 

In each of the following corollaries, S denotes a topological pseudo-group 
satisfying (viii). We define S to be 7, (i = 1, 2) if S — e is 7, in its relative 
topology. A topological space is 7, if each of its points is closed, and 7, if 
every two distinct points have disjoint neighbourhoods. A space is first 
countable if every point has a countable base of neighbourhoods. 


Coro.iary 1. Jf C CS is closed, and x € S is such thatx—e ¢C—e, 
then there exists a continuous o: S — [0,1] such that (x) = 0 and $(y) = 1 
fory € C. 


Proof. Since f(x) ¢f(C) and f(C) is closed in G, there exists (3, p. 188) 
a continuous ¢o: G — [0, 1] such that ¢o(f/(x)) = 0 and ¢@ = (f(y)) = 1 for 
y € C. Hence, @ = ¢of has the desired properties. 


Coro.iary 2. Jf S is first countable, then S is pseudo-metrizable with a 
right-invariant pseudo-metric. 


Proof. Since G is first countable in its relative topology, it follows from (3, 
p. 210) that G is pseudo-metrizable with a right-invariant pseudo-metric 
r(x, y). Then s(x, y) = r(f(x), f(y)) is the desired pseudo-metrix for S. Since 
f(x — z) = f(x) — f(z), s is also right-invariant. 


Coro.iary 3. If S is T, and locally compact, then there exists a right-invariant 
Haar measure yu in S. 


Proof. Siuce G is T; and locally compact, there exists a right-invariant 
Haar measure yo on the Borel sets of G (cf. 2, p. 254). Each Borel set BC S 
is evidently of the form f~'(Bo) where By is a Borel set relative to G. Hence, 
u(B) = wo(f(B)), and the right-invariance follows as in the proof of Corollary 2. 

To show the necessity for some additional assumption such as (viii) in the 
preceding corollaries, we give the following example, based on one in (1), in 
which S is a topological pseudo-group which does not satisfy (viii). In addition, 
S is compact, connected, first countable, and 7;, but not 72, and hence, not 
pseudo-metrizable. Let S be the interval [0, 2], and let the operation in S be 
defined as subtraction (mod 1). Then S is evidently a commutative pseudo- 
group, and G = [0, 1). The neighbourhoods of the points in (0, 2] are those 
of the relative Euclidean topology, while the neighbourhoods of 0 are of the 
form [0, p) LU (¢, 1), where p, o € (0, 1). Evidently S is a topological pseudo- 
group, and the other properties of S are easily verified. In particular, 0 and | 
have no disjoint neighbourhoods. 


5. Topological half-groups. A topological half-group H is defined as 
a half-group with a topology in which x — y is jointly continuous in x and y. 
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For the theorems in this section, we do not need as strong a condition as 
(viii), and we shall require only the following: 
(ix) Forevery U € J(H) and every x € H,x € Uimpliesx — (x — x) €U. 
The notation of the proof of Theorem 5 will be adhered to throughout this 
section. From the foregoing definition, we see that each G, is a (not necessarily 
T,) topological group in its relative topology, and each S, is a topological 
pseudo-group. 


THEOREM 8. Let H be a topological half-group satisfying (ix) and K > Ean 
invariant sub-half-group of H. Then the quotient mapping from H onto H/K 
becomes an open, continuous mapping when H/K is given the quotient topology, 
and H/K becomes a topological group. 


Proof. We first consider the case K = F and show that, for any open 
UCH, U* =" (A(U)) is open. Let V = Un eada*(U). Since y is con- 
tinuous on H and y¥,(x) = x for all x € H, Visopenand VC U*. If x € U*, 
then x = u for some u € U. Now u € S, for some a € A, andu=u—-e 
=u— (u — u) € G,C\ U in view of (ix). Thus, we can assume that u € G,. 
Then ¥.(x) = va(u) = u, and since W(x) € Gz, we have ¥.(x) = u. Hence, 
x € Vand U* CV, so that U* = V is open. It follows immediately that h 
is continuous and open, and G is a topological group in the quotient topology. 

Now let K > E bean arbitrary invariant sub-half-group of H, k the quotient 
mapping from H onto H/K, and @ the quotient mapping from G onto G/N, 
where V = h(K). From the corollary to Theorem 5, it follows that, for each 
x € H, x* = k“(k(x)) = h-'(@-'(0(h(x)))). Since h# and @ are each open, 
continuous mappings, we conclude that if U C Hisopen, then U* = k-'(k(U)) 
is open. Hence, & is open and continuous, and H/K is a topological group in 
its quotient topology. 

A topological half-group H will be said to be 7, (4 = 1,2), provided 
r = UG, is T, in its relative topology. If H satisfies (ix), then each A, is a 
topological, as well as an algebraic, isomorphism between G, in its relative 
topology and G. The continuity of 4, follows directly from Theorem 8. Let 
U CH be open. Since the range of y, is G. and ¥.(x) = <x for all x € H, 
each x € ¥.~'(U) is congruent (mod F) to some element in U (\G,. More- 
over, a(x) = x for each x€G,, so that ¥.'(U) D UN\G,. Hence, 
h(U (\ Gz) = h(¥a'(U)) is open, and h, is open. Thus, if H is T;, then G 
is also T, 

Consider now the set G X E and the operation defined in it as follows: 


(gi, €a) © (ge, es) = (g1 — B2, €a)- 


Under this operation, G X E evidently becomes a half-group satisfying (vii) 
(cf. the discussion preceding Theorem 6). If we introduce the product topology 
in G X E derived from the quotient topology in G and the relative topology 
from H in E, then G X E becomes a topological half-group. In fact, if Ui K Vi 
and U; X V2 are open in G X E, then (Ui X Vi) o (U2 X V2) = (Ui — U2, Vi. 
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Let ¢: H —~G X E be defined as follows: if x € S,, then @(x) = (h(x), e.). 
Since, for any y € H, x — y € S,, we have 


o(x — y) = (A(x) — h(y), ea) = (x) 0 O(y). 


The restriction of @ to [ is clearly one-to-one and onto, but, in general, 
x — y # $'(o(x) o o(y)) for x,y € IL. 


THEOREM 9. If H is a topological half-group satisfying (ix), then @ is a con- 
tinuous, open homomorphism of H onto G X E. 


Proof. It remains to prove that ¢ is continuous and open. Let w(x) =x—-x 
for all x € H. Then w is continuous on H, w(S,) = e. for each a € A, and 
(x) = (h(x), w(x)). If UK V CG ®X E is open, then ¢ "(UX V) = A-'(U) 
(\w-'(V) is open in H, and ¢ is continuous. Now let U C H be open and 
y € ¢(U). By an argument used in the proof of Theorem 8, we can assume 
that y = $(x), where x € U\G, for some a € A. Since x = @, — (€4 — x), 
there exist neighbourhoods V, W, of x, ¢2, respectively, such that W — (W—V) 
C U. Let V’ = h(V) and W’ = WC\E. Then V’ X W’ is open in G X E, 
and clearly y € V’ X W’. If v € V and eg € W, then eg — (eg — v) € U, 
h(es — (eg — v)) = h(v), and wes — (es — v)) = eg. Hence, 


(h(v), es) = o(es — (es — v)) € o(U), 


and V’ X W’ C o(U). Therefore, ¢ is open. 
A further consequence of the continuity of w is that, in a 7, topological 
half-group H, each S, is closed in H. 


THEOREM 10. Jf H is a connected, T, topological half-group with a finite 
number of idempotents, then H is a (topological) pseudo-group. 


Proof. We have w(H) = E, whence E is connected and 7; in its relative 
topology. Since a finite 7, space is discrete, E reduces to a single point. 


CoroLiary. If, in addition, the right cancellation law holds in H, then H is 
a (topological) group. 


To show the necessity for the assumption that E be finite, let H = [0, 1] 
with the relative Euclidean topology and the structure of the simple half- 
group mentioned in § 2. Evidently H is a compact, connected, 7; topological 
half-group in which every element is an idempotent. 
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ON THE CLASS NUMBER OF REPRESENTATIONS 
OF AN ORDER 


IRVING REINER 


1. Introduction. We shall use the following notation throughout: 
= Dedekind ring (5). 


multiplicative group of units in R. 
class number of R. 

= quotient field of R. 

= prime ideal in R. 

R, = ring of p-adic integers in K. 


ew Ae ee 
ll 


We assume that h is finite, and that for each prime ideal p, the index (R:p) is 
finite. 

Let A be a finite-dimensional separable algebra over K, with an identity 
element e (4, p. 115). Let G be an R-order in A, that is, G is a subring of A 
satisfying 

(i) e € G, 

(ii) G contains a K-basis of A, 

(iii) G is a finitely-generated R-module. 

By a G-module we shall mean a left G-module which is a finitely-generated 
torsion-free R-module, on which e acts as identity operator. An A-module 
is defined analogously, replacing R by K. We shall assume, unless otherwise 
stated, that K is a splitting field for A; thus, the only possible A-endomor- 
phisms of an irreducible A-module X are the scalar multiplications x — ax, 
x € X, where a € K. 

As in (3), we may form the non-zero ideal g C R, defined as the inter- 
section of the ideals which annihilate the one-dimensional cohomology groups 
H(G, T), where T ranges over the set of two-sided G-modules. (In the special 
case where G = R(Il) is the group ring of a finite group II, the ideal g is the 
principal ideal generated by the group order (II :1).) Let P = {pi,..., pi} 
be the set of distinct prime divisors of g, and set 


(1) =I] 9. 
peP 


For any G-module M, let KM be the A-module which consists of the 
K-linear combinations of the elements of M. If we set A, = R,G, we may like- 
wise define the A,-module M, = R,M. Two G-modules M and N are said to 
be in the same genus (notation: M vy .V) if and only if for each p, the modules 
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M, and N, are A,-isomorphic. As is shown in (7), Mv WN if and only if 
KM = KN and M, = N, for each p € P. 

For any A-module L’, let S(L’) be the collection of G-modules L for which 
KL = L’. Suppose that S(L’) splits into r, genera, and into rg classes under 
G-isomorphism. As is shown in (6; 7; and 9), both r, and rg are finite. 
The purpose of this paper is to consider the relation between r, and rg. For 
the special case where L’ is irreducible, Maranda (7) has shown that rg = hr,. 
We shall restrict ourselves to the case where the irreducible constituents of 
L’ are distinct from one another. If L’ has k distinct irreducible constituents, 
we shall prove 
(2) rq > h'r,. 


Further, we shall show that equality holds provided that 


(3) For each a € R such that (a) + g = R, there exists 8 € u for which 
8 = a (mod g*—'). 
Finally, we shall obtain formulas for r, and rg in the special case where 


k = 2. These formulas will show that if condition (3) fails, then »_ may exceed 
h*r, for this case. 


2. Binding homomorphisms. In this section, we shall drop the hypo- 
thesis that K is a splitting field for the algebra A. Let L be a G-module 
which contains a submodule M, and assume that M is an R-direct summand 
of L. Define N = L/M to be the factor G-module. Every element of L is 
then uniquely representable as an ordered pair (n,m), n © N, m © M, 
where the structure of L as R-module is given by 


(4) (n,m) + (n’,m’) = (n+ n',m +m’), a(n,m) = (an, am), 

for n,n’ € N, m,m' € M,a € R. Further, the action of G on L is given by 
(5) g(n,m) = (gn, A,(m) + gm), g © G, where A, © Home (N, M). 
Let A :G— Hom, (N, M) be the R-homomorphism defined by g — A,. The 
condition (gh)(n, m) = g(h(n, m)) is equivalent to 

(6 Agn(m) = gA,(m) + A,(hAn), ghee G,ne N. 

Call A € Hom, (G, Hom, (N, M)) a binding homomorphism if (6) holds, and 
let B(N, M) be the R-module consisting of all binding homomorphisms relative 
to V, M. The R-G-module L is then completely determined by equations (4) 
and (5), once an element A € B(N, M) is fixed. Let us denote this module L 
by (N, M; A). 

It is convenient to turn Hom, (NV, M) into a two-sided G-module T by 
defining 

(gt)(m) = g(t(m)), (tg)n = t(gn), g& G,n © N,t € Hom,(N, M). 
We may then characterize B(N, M) as the set of all A © Homes (G, T) for 
which 
(7 Ag, = gA, + Ayh, g. he G. 
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Now fix t € T, and define A € Hom, (G, T) by 
A, = gt — ¢g, géG. 


We find readily that A € B(N, M). Let B’ (N, M) be the R-module con- 
sisting of all the binding homomorphisms so obtained by letting ¢ range over 
all elements of T. Define the R-module 


C(N, M) = B(N, M)/B’(N, M). 


From (9) we know that C(N, M) contains only finitely many elements. 
Furthermore, from the definition of the ideal g, we have 


g- B(N, M) C B’(N, M) 


for any N, M. Finally, if [A] denotes the class A + B’(N, M) of the element 
A € B(N, M), then we have: 


[A] = [A’] = (N, M; A) & (N, M; A’). 


In fact, if ¢ € T is such that A,’ — A, = gt — ég, g € G, then the map (n, m) 
— (n,m — tn) gives the desired isomorphism. 


In the above discussion, replace R by R,. If L* is an A,-module which 
contains a submodule M* as R,-direct summand, then L* = (N*, M*; A*), 
where N* = L*/M*, and where 


A’ : A, >Home, (N*, M*). 


is an R,-homomorphism satisfying A*,, = xA*y + A*,y, x,y € A>. Define 
B(N*,M*), B’(N*, M*) and C(N*, M*) as above. For A* € B(N*, M*), again 
let [A*] = A* + B’(N*, M*). If y() is defined as in (1), we have 

(8) 2’ B(N*, M*) C B'(N*, M*) 

where = is an element of p such that x ¢ p”. 

Now let N, M be G-modules, and let N,, M, be the corresponding 4,- 
modules. There is a natural isomorphism of B(N, M) into B(N,, M,) which 
may be described as follows: for each A € B(N, M) and each g € G, the 
map A, € Hom,(N, M) may be extended in a unique manner to an element 
of Homg,(N,, M,); we may then define A, for each x € A, by linearity. In 
this way, A is extended in a unique manner to an element A? € B(N,, M,). 
The map A-— A? carries B’(N, M) into B’(N,, M,), and so induces an 
R-homomorphism of C(N, M) into C(N,, M,). 

We may now define an R-homomorphism 

@: C(N,M)—> 2, CW», M,) 


by means of 


o[ A] = ([A"],..., [A74)). 
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From (8), we know that @ has kernel 0. We shall in fact show that ¢ is 
an isomorphism “onto.” 


THEOREM 1. 
C(N, M) = >> C(N,, M,). 
peP 


Remark. A slightly different version of this was first proved by deLeeuw 
(1). We shall not use the results of (8), but instead shall give a self-contained 
proof of the theorem. 


Proof. We show firstly that the ¢ is an “‘onto’’ mapping. For each p € P 
suppose an elernent 2? € B(N,, M,) chosen. We must prove the existence of 
an element A € B(N, M) such that [A?] = [07],p € P. Let T = Hom,(N, M), 
and let us set 

T, = Homeg,(N,, M,) = R, Homg (N, M) = R,T 


for each prime ideal p. 
For each p € P, we may choose an element x € p such that x ¢ p”, and such 
that x does not lie in any other prime ideal in the set P. Set 


vp). 
a= [|]; 
peP 


then a € R, and for each p € P we may write 


a=n"d, d,€R, d,=unitin R,. 
Define the integral ideal 6 by 
(a) = - I] r”. 
peP 


Then 6 is not a multiple of any of the prime ideals in P. 
We now make use of equation (8) to deduce that for each p € P, there 
exists an element uw? € 7, such that 


a: = gu’ — u’z, g €G. 


On the other hand, T is a finitely-generated R-module, so there exist elements 
ti,...,t, € T such that 


T = Rt; +...+ Rt, 
whence 
ie = Ryhi + eee + R,t:. 


We may therefore write (for p € P) 


u? — > Bits, 8 € R,. 


t=—1 


Let us now choose a;,...,a, € R such that 


a, = 62 (mod #"R,), peEP, a,=0(modbd). 
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t=a' > ad, € KT, 
i=1 
and define A € Hom,(G, KT) by 
A, = gt — ég, g €G. 
We shall show that this is the desired A, that is, A € B(N, M), and [A?] = [07] 
for p © P. For p € P we have 
a(Q— A,)=g’—vg, g €G, 
where 
v=’ —at= Zz (8% — ay)ty. 
i=1 
From the way in which the a; were chosen, we may therefore write 
v= 2" dw’, 
where w’ € 7,, and thus 
 — A, = x” (gw” — w’g),g € G. 
This proves that for each p € P, 
a — A” € x” B(N,, M,) C B’(N,, M,), 
and shows incidentally that 
(9) A€T%> 2€6G €P. 
On the other hand, we note that for each prime ideal g ¢ P, the elements 


a a~'a, all lie in R,, and hence 


A, € T,, g €G. 


Coupled with (9), this implies that 
A, € t) Ty, +4 € G, 
¢ 


where g’ ranges over all prime ideals. The above intersection is precisely T, 
and so A € Hom,(G, 7). That (7) holds follows at once from the definition 
of A; consequently, A € B(N, M). This completes the proof that ¢ is “onto.” 

In order to show that ¢ is an isomorphism, let Q € B(N, M) be such that 
a € B’(N,, M,) for all p € P; we must show that @2 € B’(N, M). Since 
2? € B’(N,, M,), there exists for each p € P an element uw? € 7, such that 


OF = gu” — us, g €G. 


By the preceding construction (with a = 1), there exists A € B’(N, M) (since 
now ¢ © T) such that 


Av = @(modr”T7,), g €G. 











ee 
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Therefore 


A-—-Q€ gB(N, M) C B’(N, M), 
which shows that 2 € B’(N, M). 


Coro.uary. If N, N*, M, M* are G-modules such that N vy N* and M vy M*, 
then C(N, M) & C(N*, M*) as R-modules. 


More generally, let 
i 7 i > Li D (0) 


be a set of G-modules such that each is an R-direct summand of its prede- 
cessor. Define NV; = L;/Li4; to be the factor G-module. Then as above, 


every element of L; is uniquely representable as an ordered k-tuple (m, .. . , Ny) 
n, © Ny, where 
’ ’ ’ ry 
(m1,..., My) + (m1,..., Me) = (0, + j,..., Me + ME), 
a(m;,...,M%,) = (am,..., an, ) 


for n,n; © Ny, a € R. The action of G on L; is given by 
12 lk k—1.k 
g(m,..., M) = (gms, gmo + A omy,..., 2% + A oti t+... + Ay * m-1), 


where each A,‘ € Homg(N,, N;), and where the R-homomorphisms A” : 
g— A,“ satisfy conditions analogous to (7). Let B(N,,...,.- V,) denote the 
set of systems {A‘’} satisfying these conditions. We denote the module L, by 
the symbol (N,,..., Nx; {A*}). 


3. Isomorphisms of modules. Throughout this section, we fix an A- 
module L’ with a composition series. 


L=91:5...)&) (0), 


and let NV,’ = L,’/Lii1’. We assume here that NV,’ and N,,' are not isomorphic 
for 1 # j, and further that K is a splitting field for A. For any L © S(L’), 
the A-module KL will have a composition series 


iiaeif Dif |)...J te 2 ©) 
in which Ly’/Ligi"” = N¢. Setting L, = Li’ (\ L, we see that L;, is a G- 
submodule of L for which KL, = L;’. Furthermore, L,,; is a pure R-sub- 
module of L,, and therefore (by 5) is an R-direct summand of L,;. Put V,; = 
L/L; then KN, = Ni, and 
L = (N,,...,. Nx; {A”}) 
for some {A‘} € B(N,,..., Nz). 
LemMA 1. Let M,, N;, € S(N4), 1 < i < Rk, and suppose that 


(Mi, ...,Ma; {A“}) & (Ni... , Ne; {2%}). 
Then M,=Niz 1 qt<ck. 
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Proof. (A modified version of this is given in (2).) It suffices to prove 
that if (N, M; A) &(N,M;4), where KN2KN and KM=KM, and 
where KN and KM have no common irreducible constituent, then M > M 
and VN & N. Once this is established, a simple induction argument completes 
the proof. 

Suppose that 6: (N, M; A) = (N, M; A) is given by 

6(n, m) = O0(n, 0) + 0(0, m) = (0;(m), v(nm)) + (u(m), 62(m)), 


where 


6, € Hom,(N, N), » € Homa(N, M), » € Homa(M, N), 62 € Home(M, M). 
From 6g(n,m) = g0(n, m) we obtain at once 

(10.1,10.2) 6:g + wA, = gh, ug = gh, 

(10.3,10.4) AA: + gv = vg + O2A,, Og = Ayu + gOe. 


From (10.2) we have nu € Homg(M, N), and hence » = 0, since by hypo- 
thesis KM and KN have no common irreducible constituents. Equations 
(10.1) and (10.4) then imply that 6, € Homg(N, N) and @. € Homg(M, M). 
Since @ is an isomorphism of (N, M; A) onto (N, M; A), we find readily that 
6,:N2N and .:M=M. 


LemMMA 2. Let (Ni, ..., Ne; {A%}) and (Ni, ..., Ne; {2*}) be G-isomorphic 
modules in S(L’), where N; € S(N,’). Then there exist units B;,..., By € U, 
and homomorphisms t,, © Homg(N;, N;), such that the isomorphism between 
these G-modules is given by 


(m1, ... , Me) —> (Bitty, Bote + tiem, ... , Bette + tigts +... + tees, 2-1). 


Proof. From the proof of the preceding lemma, we find that the isomor- 
phism must be given by 


(m1, . . «5 Mx) —> (O14, Ooms + trom, ..., Oymy + tigi +... + tes xMe-1), 


with each 6,: N; = N, and each t,, € Homg(N,, N;). Since KN, is an abso- 
lutely irreducible A-module, 6, must be given by scalar multiplication by a 
unit of R. This completes the proof. 


If U, V are R-modules, and f;, fe € Hom,(U, V), we shall often abbreviate 
the congruence f; = f2 (mod g* Hom,(U, V)) as fi: = f2 (mod g*). A similar 
notation will be used for R,-modules. 


Lemma 3. Let M,,..., M, be G-modules, not necessarily irreducible, and let 
L = (Mi,...,M;{A%}), L= (,..., Me; {9%}) 
be G-modules for which 
A‘ = 2” (mod 9"), l<i<j<k, 
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where n is a fixed integer > k — 1. Then there exists a G-isomorphism 6: L = L 
such that @ = I (mod g"-**"), where I: LL = L is the R-isomorphism given by 
(m1, . . . » My) —> (My, ... , Mp). 


Proof. The result is trivial for k = 1; let k > 1, and assume the result 
holds at k — 1. Let us set 


A= (Me...,Me &..-., a ™, 


R= (,.....aes@...-. 8. 


From the induction hypothesis we deduce the existence of a G-isomorphism 
6,: A = A such that 


6, = I (mod g***”). 
The map (m,, 5) — (m,, 6:6), where m, € Mi, 6 € A, then gives a G-iso- 
morphism 
6, : (My, A; A”,..., A™) & (Mi, 4; A”,..., A”) 
for some (A2*,..., A”) € B(M,, A), and we have 
6, = I (mod g"**”). 
Now set 
Re(Pisc.8 kh Go @,....@. 


Then we see that both A and @ are elements of B(M,, A), and that A = Q 
(mod g"~***). By considering this congruence for the powers of the prime 
ideals dividing g, the method of proof of Theorem 1 shows the existence of 
an element W € Hom,(M,, A) such that 


(A — Q), = gW — Wz, g €G, 


and where, furthermore, W = 0 (mod g**+"). The map (m, 5) — (m,, 5— Wm) 
then yields a G-isomorphism @; : (M,, A; 2) = (M,, A; A), where 


6; = I (mod g**"). 
Therefore 
6502 : (Mi, A; A’,..., A™) — (Mi, J; 2”,..., 2") 


is a G-isomorphism of L onto L such that 
630, = I (mod g”**"). 


4, Integral classes and genera for modules with two distinct con- 
stituents. Throughout this section, we suppose that L’ is an A-module with 
two distinct irreducible constituents NV’ and M’; we assume again that K is 
a splitting field for A. Let S(L’) be partitioned into rg classes under G-iso- 
morphism, and into r, genera. We shall obtain formulas for rg and r,. 
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Lemma 4. Let N € S(N’), M € S(M’). Then (N, M; A) & (N,M:;4) if 
and only if there exists 8 € u such that [A] = BA}. 


Proof. From Lemma 2 we deduce the existence of units 8, 82 € u, and 
of ¢ € Homeg(N, M), such that the isomorphism (NV, M; A) = (N, M: A) is 
given by (n,m) — (8.n, Bom + tn). This implies 


A, = Bi'B2A, + g(6r't) — (Bite, g EG. 


Setting 8 = 8:-'B2, we have [A] = A[A]. Conversely, starting from such a 
relation, we may reverse the steps to obtain an isomorphism of the modules. 


Lemma 5. Let N € S(N’), M € S(M’). Then (N, M; A) v (N, M; A) if and 
only if there exists an element a € R such that (a) + g = R and [A] = afAl. 


Proof. Let (N, M; A) v (N, M; A). As in the preceding proof, we deduce 
that for each p € P, there exists an element a, which is a unit in R, such 
that the classes [A”] and [A”] in C(N,, M,) are related by 


[A] = a,[ A’]. 


Choose a € R such that a = a, (mod p”™) for each p € P; then (a) +49 =R 
Furthermore, (a — a,)B(N,, M,) C B’(N,, M,), so that 


alA"]}=a,[A"], p € P. 


Therefore [A?] = [aA] for all p € P, and so by Theorem 1 we have [A] = 
[aA] = afA]. 

Suppose now that S(N’) splits into » genera; according to (7), each genus 
splits into # classes under G-isomorphism. Let us choose representatives of 
the hy classes, say {NV ,j':1 < i < », 1 <7 < A}, so that all the modules with 
the same subscript lie in the same genus. Likewise choose representatives 
{M,':1 <i<u,1 <j <A} of the Ay classes into which S(M’) splits. Let 
(N, M;T) € S(L’), and suppose NV v N,‘, Mv M,’. Then for each p € P, 
there exists an element 


o € B((Ni)»,(M?})>) 
such that 
(N,, M,; 1?) = ((Ni)p,(Mi)p; 2) 


as A,-modules. By Theorem 1, there exists A € B(N,', M,’) such that 
[A?] = [7] for all p € P. Therefore 


(N,M;T)P = (Ni, Mi; A)», p 


fan 


P, 
and so 


(N,M;T) V (Ni, Mj; A). 
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Hence, every module in S(L’) is in the same genus as (V,', M,’; A) for some 
choice of ¢ and 7 and some A € B(N,‘, M,’). Further, 
(Ni, Mi; A) V (NY, M7; A’) 


implies, by the method of proof of Lemma 1, that i = 7’ and j = j’. Let us 
set 


(11) H,, = {(Ni, Mi; A): A € BIN], M))}, 1 <i<eyl<j<u, 


and suppose that H,, splits into r;; genera. Then we have at once 
(12) ro = Dorey. 
7 


On the other hand, any module in S(L’) is G-isomorphic to (NV,‘, M,’; A) 
for some i,j, p, 7 and A. Further, by Lemma 1, two such modules cannot be 
isomorphic unless they have the same set of indices 7, j7, p, . Let us set 


S(i, p;j,0) = {(N,, Mz; A): A € B(N,, M2)}, 


and suppose that S(t, p;j, a) splits into s(t, p;j,¢) classes. Then 
'o = } S(t, pi j, 7). 
1.J3.e.¢ 
However, Lemma 4 states that (V,‘, M,/; A) & (N,‘, M.’; A) if and only 
if there exists 6 € u such that [A] = 6[A]. Furthermore, the Corollary to 
Theorem 1 shows that C(.V,‘, M,’) is (as R-module) independent of p and 


a. Therefore s(i, p;j,0) = s(t, 1;j, 1) for all p and o, and we have 
(13) ro=h >) sey 
i,j 


where s,; = s(t, 1;j, 1) is the number of classes into which H,, splits. 

Before proceeding with the calculation of r;; and s,,;, it will be convenient 
to introduce some notations. For a non-zero ideal a in R, let (a) denote the 
number of residue classes in R/a which are relatively prime to a. Ifa + 6 = R, 
then (ab) = ¢(a)¢(b). Next, let u(a) denote the number of distinct residue 
classes in (u + a)/a; of course, u(a) is a divisor of (a). However, u(a) is 
not a multiplicative function of a, as is seen from the example where K is 
the rational field. 


LemMaA 6. Let VN € S(N’), M € S(M’'),andH = {(N, M; A): A € B(N,M)}. 
Suppose H splits into r genera and s classes. Let d(a) be the number of elements 
in C(N, M) with order ideal a. Then 


r= p> d(a)/ (a), s= > d(a)/u(a), 


a 
both sums extending over all divisors of 4. 


(The order ideal of an element c € C(.V ,M) is {a € R:ac = 0}.) 
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Proof. Let us use the symbol (NV, M; c) to denote the collection of mutually 
isomorphic modules {(N, M; A): A € c}, where c € C(N, M). By Lemma 4, 
(N, M;c) and (N, M;c’) cannot lie in the same genus unless ¢ and c’ have 
the same order ideal. Consider the set of d(a) elements of C(N, M) with 
given order ideal a. For a fixed c in this set, all those c’ of the form ac, where 
a € R is such that (a) + g = R, will yield modules in the same genus as 
those obtained from c. But as « ranges over all elements of R for which 
(a) + g = R, ac gives exactly ¢(a) distinct elements of C(N, M). Therefore 


f= » d(a)/¢(a). 


A similar argument gives the formula for s. 


Let d,(p") denote the number of elements in C(N,, M,) having order ideal 
pb”. Then 


d,(p") = r(p") — r(p"’), 


where 1r(p") denotes the number of elements of C(N,, M,) which are anni- 
hilated by »*. From Theorem 1, 


d(a) = J] d,(p*”), where a = [] p™. 
peP 


peP 


We may therefore write 


r=T] 1b a(0")/4(0%, 


which confirms the resu't in (7) that the number of genera is the product 
over all » € P of the number of classes into which S(L’) splits under A, 
isomorphism. The corresponding multiplicative formula for s fails to hold, 
because u(a) is not multiplicative. 

Applying Lemma 6 to our original problem, we may summarize our result 
as follows. 


THEOREM 2. Let N',...,N” be representatives of the genera into which 
S(N’) splits, and M',..., M* representatives of the genera of S(M’). For each 
divisor a of q, let di;(a) denote the number of elements in C(N‘, M’) having 
order ideal a. Then S(L’) splits into r, genera and rg classes, where 


r, = p> DX 4ij(a)/o(a), ro = h p> DX dis(a) /u(a). 
i,j i.j 
Here, ¢(a) is the number of residue classes in R/a which are relatively 
prime to a, and u(a) is the number of distinct elements of (u + a)/a. 


COROLLARY. We have rg > h’r,, with equality provided that $(q) = u(q). 
Furthermore, if any C(N‘, M’) contains an element of order ideal a, where 
u(a) < o(a), then rg > hry. 
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5. Integral classes and genera in the general case. Now let L’ be an 
A-module with k distinct irreducible constituents, and let K be a splitting 
field for A. We preserve the notation introduced at the beginning of § 3. In 
this section we shall generalize the results given in the Corollary to Theorem 2. 

For each « (l < « < Rk), let {N.4:1 <i < v(x), 1 <7 < h} be a full set 
of representatives of the Av(x) classes into which the set S(N,’) splits; suppose 
these representative modules are so chosen that modules with the same 
indices i and « lie in the same genus. Then every module in S(L’) is of the 
form 


(Wy*,..., Wats {A%}). 
Let S(t:, j1; ~~. 3 %,je) be the set of all such modules obtained by letting 
{A‘} range over all systems in 
DO? .. 038” 
and let this set split into r(i1, j1;... te, je) genera amd s(ts, ji; ~~. + te» je) 


classes. From the Corollary to Theorem 1, we see that r(t;, j1;... 3 te, je) is 
independent of (j:,..., jx), and therefore 


re © h~* ys r (is, jr, pees + tes Je) ‘oe = . S(i1, Jays - ** + tes Je), 


both summations extending over all possible values of the 7’s and j’s. This 
implies the result that 


tq > h'r,. 
Finally, we prove: 
THEOREM 3. If u(g*") = o(q*"), then rg = h'r,. 
Proof. We remark that the hypothesis of the Theorem is simply a restate- 
ment of condition (3) given in the introduction. To prove the theorem, we 


need only show that r(i:, ji; ~~. 3 te, jx) = S(ti, fii. .~ 3 te jx). We simplify 
the notation by letting M, € S(N,’), 1 < « < k. We shall prove that if 


L = (M,,...,Mz;{A%}), LD = (My,..., Me; {A™%}) 
are such that Lv L, then also L = L. 
Since L, = L, for each p € P, Lemma 2 shows the existence of units 
B,,..., 8,” in R,, and homomorphisms 
Ui, € Hom, ((M 1%)», (M;)p) 


such that the isomorphism L, & L, is given by 


(my, ..., mx) — (Bim, Bim2 + tiym,..., Bim, + thm, +... + Ba1..Me-1). 
By the hypothesis of the theorem, we may choose units §;,..., 68, € u such 
that 


6. = 62 (modp*”™), peEP, 1<K«<k. 
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As in the proof of Theorem 1, we may choose homomorphisms w,, € 
Hom g(M,, M;) such that 


w=, modp*™"™, 1ecic<jck, peEP. 
Then the map 


(my, sees mM, ) => (Bym,, Bom, + W211, ...-, Bym, + WymM) — eee oe Wy—1,~My-1) 


gives a G-isomorphism of L onto a module L* where L* = (M,,..., My; {Q*}) 
and Q = A” (mod g*’) for 1 < i <j < k. By Lemma 3 we then have 
L* = L, which completes the proof of the theorem. 

It would be of interest to obtain formulas for rg and r, which generalize 
those given in Theorem 2. 
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